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Biometrika was founded by Karl Pearson and W. F. R. Weldon in consultation with Francis 
Galton, and the first number appeared in October 1901. Its origin was due to the Royal 
Society requesting that in papers submitted for publication mathematics should be kept 
apart from biological applications, but there is no need to discuss that old disagreement 
now, and anyone who wants to read about it can refer to Pearson’s Life of Francis Galton and 
his memoir of Weldon. The important thing is that Biometrika was founded and is still alive 
after fifty years of activity. As individuals grow older some of their ways of life change, and 
this is true also of journals. In the case of Biometrika there is less biological and anthropo- 
logical work than in many of the earlier numbers, but there are plenty of papers on the 
theoretical side of statistics containing the arithmetical work with which the subject should 
properly be associated. The publication of tables which will be helpful in practical calculation 
has also continued. 

There is another side to all such affairs and it is one which, when it relates to individuals, 
we are apt to be shy of discussing; it is the financial side. In the early days a guarantee fund 
was provided by the founders and their friends, and the two founders not only did a great 
deal of work for the journal but dipped their hands in their pockets from time to time. After 
Weldon’s death in 1906 the practical management of Biometrika was in Karl Pearson’s— 
hands, and he had a good deal of anxiety about it in the early days of the 1914-18 War as the 
circulation fell rapidly. Variations of the arrangements with the Cambridge University Press, 
who have printed the journal ever since it started, were made for reasons of economy, and 
Pearson and Mrs Weldon gave money to help to keep it going. The journal was never run 
for profit but solely for scientific ends. Various auxiliary publications had been issued from 
1914 onwards, such as the Tables for Statisticians and Biometricians, Tables of Gamma and 
Beta Functions, etc. In 1935 a trust was set up by Karl Pearson with not less than five 
trustees, who were to be responsible for the financial side and for sanctioning the reissue of back 
numbers that had gone out of print, the issue of auxiliary publications, etc. The trust deed 
gives the trustees power to appoint a Managing Editor who shall ex officio become a trustee, 
and with his agreement they may appoint assistant editors, but they are not responsible for 
the editorial work, though, of course, they may be consulted if the Managing Editor wishes 
to do so. 

As has been indicated the editorial work in the early days was done by Pearson and Weldon 
(with some assistance from Davenport) in consultation with Galton. From 1906 Pearson 
assumed entire editorial responsibility, though he had some help from W. R. Macdonell and 
from members of his own staff at University College. In 1924 his son, E. 8. Pearson, became 
assistant editor and is now, as readers of the journal know, the Managing Editor, a task which 
he first undertook in 1936. He has, of course, had help in various ways, and recently two 
associate editors have been appointed. Below these notes alist is given of those who have 
served the journal since its foundation; and those who contribute to it or read it may well 
be grateful for the help given in the past. It may be said, in conclusion, that it is the wish of 
everyoue now concerned with Biometrika that it may continue to be of use in the subjects 
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with which it has been associated for so long, that new authors may be added to the list of 
contributors, and that books and tables may continue to be made available for the growing 
number of people who undertake statistical work in so many and such an increasing number 


of directions. 
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1. IyTRoDUCTION 


The aim of the present paper is to derive the maximum-likelihood equations for estimating the 
death-rate among a population of wild animals from data obtained by the capture-recapture 
method. This method, which, in the form it is discussed here, consists of taking a series of 
samples from a constantly changing population, was developed in the first place for the 
analysis of tsetse-fly populations by Jackson (1939, 1948, and earlier papers), who gives various 
methods of estimating not only the number of individuals, but also the birth-rate and the 
death-rate during successive intervals of time. The same procedure has been employed in the 
field by Dowdeswell, Fisher & Ford (1940) in the case of butterflies, and by Fisher & Ford (1947) 
in studying a population of the moth Panaxia dominula over a number of years. These last 
authors describe a method of estimating the numbers and the death-rate, when the latter is 
assumed to have remained constant during the sampling period. In some of the applications 
of this technique to insect populations, the samples were taken at daily or weekly intervals, 
and the series might be continued for a relatively lengthy period of time. When the method 
is applied to populations of small mammals, however, it may not be possible to do this. The 
alternation of breeding and non-breeding seasons in many cases divides the year roughly 
into two sections, which it may be more convenient to study separately. Moreover, it may not 
be possible, for various reasons, to sample the population by means of some live-trapping 
technique at too frequent intervals. Since we were primarily interested in the application 
of the capture-recapture method to populations of this type, we were thus led to consider 
the properties of a sampling chain of a relatively short length, but in which at each sample 
the proportion of recaptures was not necessarily small. Two methods of assembling the 
observed data are discussed in this paper, and the comparative accuracy of the resulting 
estimates is illustrated by the results of a sampling experiment. 
18-2 
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From the theoretical point of view, the essentials of the problem may be put briefly as 
follows. Suppose that a population consisting of an unknown and variable number (M) of 
individuals is sampled at equidistant intervals of time ¢ = 0,1, 2,...,7, and that at each 
sampling R, individuals are captured, marked and returned to the population. (We may 
assume for the present, in order to simplify the theoretical development, that no individuals 
are either accidently killed by the method of capture employed, or removed permanently 
from the population.) Among the R, (¢ = 0, 1, 2,..., 7’) individuals captured and examined 
at time ft, let 


u, = the number unmarked; and 


s, = the number found bearing one or more marks, i.e. the total number of recaptures, 
so that R, = u,4+ 8, 


It is to be noted that the ‘recaptures’ are defined here as the number of individual animals 
recaptured, and not as the total number of marks observed among them. 

Some of these s, recaptures may have been captured only once, some twice, others up to ¢ 
times previously, and in each class all possible combinations of the markst = 0, 1, 2, 3, ... (¢—1) 
may be found. Thus, for example, at the fourth sampling taken at ¢ = 3 the following seven 
classes of marks might occur: 0, 1, 2,01, 02, 12,012. Let 


Yor... = the number of individuals recaptured at time t, bearing the previous marks 
0,1, 2,...,(f—1). 


The problem then is: from the results of such a series of recaptures to determine the 
death-rate, given that the sampling of the population is entirely at random and that the 
method of actually marking or tagging the animals does not affect their subsequent chances 
of recapture. : 

Evidently for this problem to be soluble, at any rate in a comparatively straightforward 
fashion, some simplifying assumptions must be made. Although the population sampled 
will consist possibly of a number of age classes, for each of which the age-specific death-rates 
may be different, nevertheless it is convenient, following Fisher & Ford (1947), to assume 
that over the sampling period 0 to 7’ the death-rate per unit of time remains constant. Let 


P = the constant survival factor, so that out of a total number of individuals N, alive at 
time t, PN, are alive at time ¢+ 1. 


(We are supposing that the succession of samples is taken at equidistant intervals of time, 
so that we may adopt this interval as our unit. If the sampling is at unequal intervals, the 
definitions must be altered somewhat, but no real difficulty arises.) The death-rate in the 
population as a whole over each interval of time is then Q = 1—P. If we assume that this 
death-rate is approximately the same for all age classes in the population, then the expectation 
of life of young individuals, at the age they enter the population at risk of capture, is 


€y = — I/log, P. 


The estimation of the survival factor P is, of course, a necessary and preliminary step in 
the solution of the general problem, which also includes the estimation of the total numbers 
alive in the population at time t. We propose, however, at present to confine our attention to 
this first step and to reserve the discussion of the methods for estimating the total numbers 
for a later paper. 
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It may be shown that the problem of the simultaneous estimation of the N,(¢ = 1, 2, ..., 7’) 
and P reduces to that of first estimating the latter from the recaptures of marked individuals 
only and then using this value to determine the N,. 


2. THE CHAIN OF EXPECTED NUMBERS 


If at the initial sampling R, individuals are captured, marked and released, it is easy, given 
the results of each sampling, to write down the expected number of marked individuals in 
the population falling into the various classes at a particular time ¢. In doing so, it is assumed 
that the constant survival factor P applies equally to all possible subgroups in the population. 
As an illustration, this chain up to ¢ = 3, or the fourth sampling, is given below, where at each 
time ¢ the first column gives the total number of marked individuals expected in the popuia- 
tion as a whole, and the second column represents the number observed in the sample of 
R, taken at that time. The method of constructing the chain of expected numbers is clear 
from the table, each step being determined merely by the results of the previous samplings. 
This is, of course, to adopt a purely deterministic model of the succession of events: 





t= 1 2 3 

PR, ta P?R,—Proy Toe P8Ry—P?ro,—Pros  To3 
Pu, Tis P*u,—Pryip rs 
Pro Toe Pug To3 

P*r9,— Pros o13 

Prop To23 

Pry T1293 

Proie T0123 

Total PR, 4 P?R,+Pu, 8, P3Ro+ P2u,+Pu, 8% 











Now, provided that the size of each sample is relatively small compared with the population 
at risk, which will usually be the case in practice, we may assume that we are dealing with 
a series of multinomial, instead of the hypergeometric distributions which, strictly speaking, 
should be considered in this form of sampling from a finite population. Then, at time ¢, there 
are n = 2'—1 possible classes of marks. If p,, is the expected proportion in the ith class at 
timet (i = 1, 2,...,%; 5 py, = 1) and r,,the number observed in the sample, then the likelihood 

i 


e+ may be taken, apart from a constant factor, as 
eho TT (Pa). 
Similarly for all the remaining samples; the combined likelihood is 
eho HH} (Pi) I ( Pig)" «-. I (Pir), 
and the relevant part of the log likelihood is therefore 
L= > Uru log Pit, 
v4 


from which, by differentiating with respect to P and equating to zero in the usual way, we 
obtain the equation for the maximum-likelihood estimate of P. 
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It will be seen by inspection of this table of expected numbers that in writing down the 
maximum-likelihood equations it is possible by cancelling to reduce the degree of the 
polynomials in P for the expected proportions by one. Thus for t = 2 we have 


ut PR, _ ray ( Uy y" To. i 
PRy+u, PRo+ uy, PRy+ Uy, 


and. L = reg log (PR — 19) — 8g log (PR + U4). 





It is also seen that no estimate can be made of P merely from the results of the first two 
samplings at ¢ = 0 and 1. A minimum of three ‘samples is required for any estimate to be 
made of the survival factor, and it may easily be shown (by supposing that P changes to 
P’ during the last interval) that this estimate refers to the interval of time 0-1. Extending 
the chain further, it follows that the value of P, estimated from the distributions of recaptures, 
refers to the span of time between the original and penultimate sampling, and that no estimate 
can be made of the survival factor during the last interval in the chain. 


3. THE EFFECT OF GROUPING ON THE AMOUNT OF INFORMATION 


In the type of data which commonly arises in practice, when the size of sample may be small 
relative to the total population at risk at some particular time, the number of recaptures is 
likely also to be small. A set of frequency distributions of the rp,._, fort = 0,1, 2,..., 7 would 
then consist largely of zeros or small numbers in the various subclasses. Moreover, the number 
of possible classes of marks at time t,n = 2'‘— 1, becomes very large as the length of the chain 
increases. It is therefore convenient, in order to simplify the actual computations, to adopt 
some system of grouping the observations. 

This might be done in a number of different ways, and the interesting question then arises 
of the loss of information involved in adopting a particular method of grouping. Clearly 
the method of choice would be one which both simplified the equations by reducing the 
number of subclasses, and at the same time minimized any loss of information. 

The method adopted by Jackson (1939, 1948) and by Fisher & Ford (1947) is the following. 
Essentially it consists in forming the distribution of marks observed among the recaptured 
animals, each individual being counted as many times as it has marks. Thus, from the 
observed ro;.__; at time ¢ we form the sums 


~ Tore...t = Ky, say, 


~ T012...t°°° ky 


where k,, = the number of individuals captured at time ¢ who received a mark at time z. 
For example, it will be seen from the table given in the previous section that for the fourth 
sampling at ¢ = 3, k 

03 = 93 +7013 + 023 + Tor23; 
kyg = "13 + To13 + 7123 + Tor235 


keg = 23 +023 +1123 + Tor23» 


the excess of the total number of marks over the number of marked individuals (s,) being 
Tos + Toes + 1123 + 2% o123- 





~ 
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This system of grouping will be termed Method A. 


t-1 
Putting ¥ k= s;, the total number of previous marks found among the R, individuals 
z=0 


captured, and carrying out the same series of summations for the population numbers, we 
have the following modified table. (As before, the first column at each time ¢ is the total 


number expected in the population as a whole, and the second is the number observed in 
the sample.) 


Method A 
t= 1 2 3 + 
PR, ko P*R, Koo P3R, Kes P4R, kos 
PR, kg P*R, kas PR, kaa 
PR, hes P*R, kas 
PR; esy 





TotalPR, s, P®Rj)+PR, 8, P%R,+P?R,+PR, s, P*R)+P*R,+P*R,+PR, 5% 





Evidently this method of grouping results in a considerable simplification of the expectation 
values. It may be shown, however, that in the long run it results in a loss of information.* 

The shortest chain from which we may make an estimate of P consists of the first three 
samplings at ¢ = 0,1 and 2. From the above table we have for the relevant part of the log 


likelihood L = kg log P — 8, log (PRy+ R,), 


| Log 8 RB 
— ~ 9P2 = P2~ (PR, +R’ 


Replacing ko. by its expected value the total amount of information, yielded by the data 
for a three-point sampling grouped by Method A, is therefore 


ba 
4 = P(PR + Ry 


* Another way of considering this method of grouping is the following. At time ¢, R, marks are 
returned to the population. Assuming a purely deterministic model, the number of these ‘t’ marks 
which may be expected to survive until ¢+a is P*R,. In this way the distribution of the expected 
number of marks at the time of some particular sample may easily be written down. We then assume 
that we have a random sample of this population of marks, and that the probability of obtaining the 
observed result is given by the appropriate term of a multinomial distribution. In order to estimate 
the unknown parameter P, we then take the combined likelihood of the series of results and form the 
maximum likelihood equations in the usual way. 

It has been pointed out to us, however, that the actual sampling is not quite in accordance with this 
model. Each sample consists of individual animals, and any one animal may bear more than one 
mark. The marks in the different categories do not therefore appear to be free variables, since a number 
of individual marks may necessarily be associated together on the same animal at the time of sampling. 
Under these conditions is this use of the theory of maximum likelihood to obtain an asymptotic variance 
justifiable? 

This is rather a complicated question on which we du not feel qualified to express an opinion. The 
difficulty arises when the number of recaptured animals bearing multiple marks becomes appreciable 
compared with those bearing only one mark. In practice, we should not, ourselves, adopt this method 
of grouping in such cases, but rather the system described later in the text as Method B. Nevertheless, 
we quite agree that there still remains this interesting theoretical question for consideration. 
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Taking the first three samplings as set out for the ungrouped data in the table given in the 
previous section, we have 
L = 192 log (PRy— 19,) — 82 log (PRo + uy), 


and PL _ rea 8 RG 
OP? (PRy—1%)? (PRy+%)?” 





Thus, for the same set of data when ungrouped, the total amount of information is 


ae 


(PRy—1o,) (PRo+u,)?” 


But 8; = 8,+79,. and the expected value 


I= 





Sof 
E(ro2) = ee ; 


so that, for large samples, z(2) i Ro +R, 





8) PRot+u," 
I, — (PRo—1% 1) (PRo + Ry — 193) 
Hence + jan PR\(PR,+R,) <1. 


Thus, in the case of large samples, there will be a loss of information, even in the simplest 
chain, through grouping the results in terms of marks. 

The loss of information in this simple case appears to arise owing to the fact that in 
grouping the observations in terms of marks, the expectations become functions merely of 
the total numbers caught at the previous samplings, whereas in terms of the different classes 
of marked individuals, they are functions both of the total numbers caught and of the 
frequencies observed in the various subclasses at previous samplings. Thus, in the latter 
case, the information that ry, individuals were recaptured at ¢ = 1 is retained in the expecta- 
tions for the sampling at t = 2, whereas it is lost in the former method of grouping. If this 
suggestion is correct, we might therefore expect that in a more extended chain of samplings 
the loss of information through grouping the observations in terms of marks would become 
progressively greater. In fact, it was partly in order to see whether this hypothetical increase 
in the loss of information was realized in practice that the sampling experiment described 
later was carried out. 


4. A METHOD OF GROUPING INVOLVING NO LOSS OF INFORMATION 


When the data are not grouped in any way, the expected numbers in the total population 
falling into the various subclasses of marked individuals are polynomial functions of the 
unknown P. Another method of grouping which immediately suggests itself is to sum the 
various classes at each sampling according to the degree of these polynomials. Thus it will 
be seen from the table given in §2 that at ¢ = 3, for example, there is only one third-degree 
polynomial; but there is a pair of second degree which may be added together, and similarly 
for the four single-degree terms. Examining the suffixes of the observed numbers recaptured, 
it will be seen that this procedure is equivalent to grouping the recaptures according to the 
interval of time since they were last captured. 
This system of grouping will be termed Method B. 





To 
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e If we define m,, as the number of individuals captured at time t which were last captured 


t-1 
at time x(x = 0,1, 2,...,(¢—1); © my = 8) we have the following table: 
z=0 


; Method B 
{= 1 2 3 4 
PR, ™, P?Ry—Pmg, mM P8Ry—P?mMo,—Pim. M3 PR, — P8mg, — P?mg.— Pog M4 
PR, My P*R, —Pm,, M13 P®R, — P?m,,— Pm My 
PR, Mog P?R, — Pg Mog 
| PR, Moq 





Total PR, s, P*R,+Pu, 8 PR, + P2u,+ Pu, 83 P4R, + P8u, + Pu, + Pug 8, 





It may be shown that there is no loss of information through grouping the observations 
in this way. That is to say, defining the log likelihood for the ungrouped data as L, and that 
for the above table as Lz, ifin — 6*L,,/0P? and — 0?L/¢P? we substitute the expected value in 
each class for the corresponding observed value, then J, = J. 

It is easily seen that — 0?L/¢P? will differ from —0?Z,/0P? only in respect of those terms 
which are represented in the latter by the summation of the polynomials, the remaining 














* terms being the same. Thus, for example, the only difference between the contributions to the 
j total amount of information arising from the sampling at ¢ = 3 will be that there is a term in 

i Lp i a ll M3 Ry 

of oP ~ **(PR,—m,)*"”’ 

8 

“ i corresponding to a pair of terms in 

aj eL Sins 3%} Tois01 

i : OP?" (Puy —ry2)? (Pro — Pore) 

is | 

8 As a first step, if we replace the observed 79,71. and 79,2 by their corresponding expected 

eC values, we find that 

3e R, ERS. AD yet You = PRo+% 

d PRi-—m™M .  Puy—t: Prato, P?Ro+Puy—s9’ 


and hence, since E(r,3) + E (79:3) = Z(m, 3), the contribution to the total amount of information 
will be the same in each case. By applying a similar type of argument to the remaining terms 
which differ in —0?L/0P? and —0?L,/0P?, it follows that J, = I. 


n Thus, the method of grouping the recaptures according to the interval of time since they 
e were last captured fulfils the requirements mentioned earlier. The subclasses which have to 
e | be considered are reduced in number, an obvious advantage in the case of a long chain of 
ll samplings, and in addition there is no loss of information. Thus, compared with the method 
€ of grouping by marks which is commonly employed, the present method should provide 
y a more accurate estimate of P for a given: set of data. 

1, Various other methods of grouping the data were tried, but none of these proved so 
e convenient as Method B. Either the maximum-likelihood equations were more troublesome 


{ to solve arithmetically, or there was a loss of information which varied in amount. One 
method is perhaps worth special mention, since, like Method A, it is attractive in so far as it 
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results in very simple expressions for the expected numbers. This is to group the recaptures 
according to the time they were first captured and marked. However, this, which we may 
call Method C, appears to result in a greater loss of information even than Method A. Thus, 
to take the simple three-point sampling as an example, we have 


Le = (71 + Tor2) log P — 8g log (PRy + uy), 


8,PR, 


and the expected number E(ro, +%o12) = PR+4u,’ 
ot Uy 


from which we have in relation to the total amount of information for the ungrouped data, 


Now, from the relationship given in §3, we can write 


I, I, (.- To. eo 
PR,+ Ry, 





yor 


and hence Ig <1 < Ip. 


5. SOLUTION OF THE EQUATIONS FOR ESTIMATING THE SURVIVAL FACTOR 


The solutions of the maximum-likelihood equations for the two methods of grouping the 
data are given here for a chain of five samplings. The way of extending the equations for 
a chain of greater length will be easily seen from the form of these solutions. In order to save 
space, the actual equations for L are not given, since these may be written down at sight 
from the tables in the previous sections. 


A. The method of grouping by marks 


i>] 


ede 
p (Koz + 2kog + kyg + 3kog + 2ky4 + Koy) 


>) 





_ fF _82R . 83(2PR, + R,) “ 84(3P?R, + 2PR, + Re) 
PR,+R, P?R,+PR,+R, P®R,+ PPR, +PR,+R8. 
= 0. 
In order to simplify the actual computations, this equation may be multiplied throughout 
by P and rewritten as 





7) — | ep Se PRD GAPS OP PRT 6 
()—| pap + P*R, * P8R,+P2R, + PR,’ P9R,+ P2R,+PR,+ Py)” 


where, setting out the observed distributions of marks in a convenient form of triangular 
diagram, 


S(k) = kog + 2kog + Skog 
+ kyg+2ky, 
+ Key. 


It is not difficult to show by a little straightforward algebra that the form of solution 
illustrated by Fisher & Ford (1947) on several sets of observed data reduces to that given 
here. The estimate P, together with its variance, may be obtained by iteration, anda numerical 
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example of the various steps in the solution is given in Appendix I. In solving this equation 
by the iterative method, however, it is most important to remember that for a trial value 
P,, we are actually calculating the value of P,0L/0P,. It is therefore advisable to obtain by 
division 0L/0P,, 0L/0P,, etc., before interpolating for the estimate P which makes oL/oP = 0. 
Although this correction may have only a trivial effect on the estimate P, it becomes of 
importance in estimating the variance. For, if as a result of the iteration, we find two values, 
P, < P,, between which the required value Pp lies, and if 0L/0P, = x, say, and 0L/0P, = —y, 
then we may take 1 re 


VP) ~~ aP** P,P, 
By neglecting the fact that owing to the rearrangement of the original equation we are 
actually calculating PoL/eP for trial values of P, we should therefore overestimate the 
variance unless this correction was made. 
It will be seen that for the shortest possible chain, namely, a three-point sampling, the 
equation may be solved directly for P. Thus we have 


A st Nee dt 
P dé 62 2 
ky2Ro 


the variance of this estimate being (in terms of the observed values) 





. A 
8, P? 
koe ky. 


But, for any chain of greater length, the iterative solution is the most convenient. 


V(P) = 





B. The method of grouping the recaptures according to the interval of time since they 
were last captured 


Setting out the solution in the same triangular form as that used for the table of expected 
and observed values given in the previous section, we have 


oL ath. Ry Mo3(2P Ro — mo) 4 Moq(3P?Ro — 2PM, — Mop) 














+ m4, R, + My4(2PR, — M43) 
PR,— my. P?R, — Pm, — m3 
Mo, hry 
+ PR,— mag 
_F_82Ro  , 85(2PRo+%) 8,(3P?R, + 2Pu, + Ue) 
PRy+u, ' P?Ry + Puy + Ug PR, + P2u, + Pugt uy 


= 0, 


Just as in the case of Method A, this equation, after a little rearrangement, may most easily 
be solved by iteration, and a numerical example of the solution is given in Appendix I. 
Although the type of calculation is very similar to that for Method A, the whole procedure 
takes longer. Compared with the time which must be spent in collecting the original data, 
however, any increase in time spent in computing is trivial, if a more accurate estimate of 
a particular parameter is obtained. 
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For the simple three-point sampling, we have the direct solution 


Pp a Moarsat ti Moa 
= 3 
M2Ro 


and the variance (in terms of the observed values) 


2 
D\ _ Moo 8 RF 


re) “at 





6. THE DESIGN OF THE SAMPLING EXPERIMENT 


It has been shown in §3 that the method of grouping defined here as Method A results, at 
any rate in the case of large samples, in a loss of information. This has been proved in the case 
of the shortest possible chain, namely, that given by the first three samplings at ¢ = 0, 
1 and 2, and it has been suggested that this loss of information might be expected to become 
greater as the length of chain increases. The question then naturally arises as to what we 
mean by ‘large samples’ in the case of a chain of such events. In the type of data which 
might be met with in practice, would there be in fact any advantage in adopting Method B 
instead of A? 

It seems likely that in the type of field data such as that given by Fisher & Ford (1947) 
there would be very little to choose between the two methods, since their multiple-recapture 
rate was evidently very low and in this case the results would probably be much the same 
with either method. In such cases it may be preferable to group the data by Method A, 
simply owing to the greater ease with which the maximum-likelihood equation can be solved. 
But if the recapture rate be greater, there is an increased chance of multiple recaptures, and 
then the two ways of tabulating the data would become very different. Under these con- 
ditions the greater degree of precision expected in the estimates of the survival factor made 
from data grouped according to Method B might well be realized in practice. 

Another closely allied problem of some practical interest is the following. Various species 
of small maminals, for example, the vole Microtus agrestis, pass the winter months in a non- 
breeding phase. It may be convenient, therefore, to carry out a series of live trappings during 
this period, since no dilution of the population is occurring through births. Suppose then 
that we wish to estimate the death-rate and numbers during such a period of, say, 5 months. 
Each sampling of the population requires not only a considerable expenditure of time and 
trouble over the purely technical side of live-trapping in the field, but also, in some cases, 
a special journey to the particular area studied. The question of the total cost of each sampling 
in relation to the amount of information obtained thus becomes important. Now, the 
minimum number of samples required in order to obtain an estimate of the death-rate is 
three. For a total period of 5 months we might therefore sample during every other month. 
Suppose that approximately the same number of animals can be captured and marked at 
each sampling, and that the cost in time, trouble and so forth, is constant, say x units per 
sample. If we increase the number of samples to five within the period, i.e. sampling at 
monthly intervals, the total cost increases in the ratio 5 : 3, or 1-67 times. Does the increase 
in the amount of information in regard to the various parameters which we wish to determine 
justify the additional labour and expense? 

In order to investigate these points we decided to carry out an artificial sampling experi- 
ment in the laboratory. The scale of the experiment, the number and size of samples, and 
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the type of population were suggested by the analysis of some field data for the vole, Microtus 
agrestis, which were collected during the winter of 1937-8. On the area studied it was possible 
to live-trap and mark around fifty individuals at each sampling, and later calculations 
suggested that the size of this particular population at the beginning of the winter trappings 
was of the order of 300 individuals. The death-rate, on the average, appeared to be roughly 
20 % per month. We were thus led to consider the following imaginary case. 

A population of 300 individuals at some initial date decreases owing to deaths for a period 
of four units of time. A total of five samplings, including the initial one, are made at equi- 
distant intervals, at each of which fifty individuals are withdrawn, marked and returned to 
the population. No births occur; immigration and emigration can be neglected; and a con- 
stant death-rate is assumed to be in operation during the whole period. The actual numbers 
alive (N,) in the population at each of the successive samplings and the number of deaths 
(D,) occurring during each interval, were taken to be: 








t N, D, P, 
0 300 48 0-8400 
1 252 40 0-8413 
2 212 34 0-8396 
3 178 28 0-8427 
4 150 «< ja 




















The constant survival factor was assumed to be P = 0-84, corresponding to an instantaneous 
death-rate per head of population d = —log, P = 0-194, but owing to the necessity of having 
integral values of N,, the actual values of P varied slightly for each interval. The average 
value over the interval of time 0-3, which is the most that can be determined from a total of 
five samplings, is P = 0-8403. Since the estimates which are made later are expressed only 
to the third decimal place, the last figure being in error, we may take, for all ordinary purposes, 
the expected value as being P = 0-840. 

The individuals consisted of the counters which are made for the game of ‘Lotto’. Each 
counter is a wooden, cylindrical disk of diameter 15mm. and height 8mm. with a number 
impressed on one of the two surfaces. They can be bought in numbered batches of ninety, 
and in order to distinguish different batches, they may be stained in dilute solutions of various 
coloured inks. Each individual counter can thus be defined by a colour and number. The 
shuffling and mixing of the counters before a sample was taken was done in a tin drum 
(diameter 15cm., height 26cm.) lined with felt and provided with an opening, closed by 
a sliding plate, on one side, for removing or returning the counters. This drum was fixed to 
a horizontal bar provided at one end with a turning handle and mounted in a wooden frame. 
This horizontal bar passed through the diagonal axis of the drum, so that when the drum 
was rotated the counters were also thrown from one end to the other. We are indebted to 
Mr D. A. Kempson for designing and making this apparatus. 

Each chain of samplings was carried out in the following way. Starting with 300 counters 
which had been thoroughly mixed, a sample of fifty was taken, the individual counters being 
ticked off on a list according to their colour and number. This sample was returned and the 
total of 300 counters was again mixed by about twenty slow revolutions of the drum. 
Forty-eight counters were then taken at random and discarded, to represent the number of 
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deaths occurring during the first interval of time, and a further fifty counters were removed, 
ticked off on a list, and returned to the drum. The latter was again rotated to mix the counters, 
forty were removed to represent the deaths during the second interval, and a further fifty 
withdrawn for the sample at t = 2. This sequence of shufflings and withdrawals was continued 
until the fifth or last sampling of fifty counters at t = 4. 

The whole chain of five samplings was repeated twenty times. 


7. PRELIMINARY ANALYSIS OF THE DATA 


Before discussing the results of this experiment there are two points of some importance 
which must be mentioned. The first of these is the question whether the drawings were in fact 
random samples of the population. The chief difficulty we had in mind was whether there 
was a thorough mixing of such shaped counters after a reasonable number of revolutions— 
say about twenty—of the drum. Before embarking on the real experiment, therefore, 
a perfectly straightforward series of random samplings was made from a constant population. 
It is unnecessary to describe here in detail the analysis of these preliminary data; in all the 
tests which were applied the results were satisfactory, as judged by the usual statistical 
criteria. A further indication that the mixing and sampling were reasonably satisfactory is 
given by the results from the twenty replicate drawings from the population considered here. 

Knowing the actual number of individuals in the population it is easy to calculate the 
expected number which should on the average occur in any particular class of marked 
individuals at each su¢cessive sample. These expected numbers are given in the first column 
of Table 1 for the data grouped according to Method B, and it will be seen from the second 
column that the observed means of the twenty replicates follow these very closely. (See also 
Appendix III, Table 3, where the results for the ungrouped data are given.) Moreover, if we 
include the unmarked class at each sample, there were four observed frequency distributions 
for each replicate chain, which could be compared with the expected frequencies by means 
of x*. When the data are grouped by Method B there are two classes at the first resampling, 
three at the second, and so on, so that the degrees of freedom are ¢ for the appropriate dis- 
tribution. The following were the sums of the twenty individual values of x? at each sampling 
in the chain. None of these values is excessive; in fact, the agreement with expectation is, if 
anything, rather too good, but not exceptionally so, at the last sample. These results suggest 
very strongly that the shuffling of the counters between each drawing was satisfactory, since 


we might expect that any bias due to insufficient mixing would have a cumulative effect in 
such a chain. 








t + D.F. = 20¢ (2x?) — /(40¢ — 1) 
1 19-43 20 — 

2 48-35 40 + 0-946 

3 65-08 60 + 0-499 

4 67-39 80 — 1-000 




















Although the values of y? thus appear to be satisfactory, there is an important point which 
emerges when we consider the observed variance in each class of marked individuals. In the 
theoretical development of the first few sections we have adopted, in the first place, a purely 
deterministic model in which the death-rate falls equally on all subclasses of the population, 
and secondly, we have assumed that at each sampling the probability of getting the observed 
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result is given by a multinomial distribution. Thus, for example, at the first resampling when 
there are only two classes in the population at risk, the probability of observing mp), marked 
and u, unmarked individuals in a sample of R, is taken as 


R. 


! 
1° moi( 1 — p)“1, 
mole” a(1—p) 


Table 1. The means and variances of the numbers falling into the various classes of marked 
individuals, when the data for the twenty replicates were grouped by Method B 














Variance 
Mean number of counters 
aenaaas Expected o? 
s? (19 p.F.) 

Expected Observed Multinomial |Hypergeometric 
Moy 8-333 8-10 6-944 5-589 7-04 
Moe 6-680 6-90 5-788 4-445 6-94 
My 9-920 9-75 7952 6-107 11-35 
Mog 5-105 5-45 4-584 3-315 5-94 
M43 7-581 7:30 6-432 4-651 7-17 
Mog 11-792 11-65 9-011 6-516 7-61 
Mo4 3-671 3°35 3-402 2-283 2-66 
My 5-451 5-10 4-857 3-260 6-09 
Mo, 8-480 8-90 7-042 4-726 4-52 
Msq 14-045 13-75 10-100 6-778 7:99 


























Now, strictly speaking, the appropriate distribution to be considered at each sampling is 
a hypergeometric of two or more classes. The error introduced by assuming a multinomial 
distribution is negligible when the size of the sample is small relative to the total number in 
the population sampled. But in this experiment the samples ranged from 17 to 33 % of the 
population at risk, and we should expect the observed variances to be closer to those expected 
for hypergeometric rather than multinomial distributions. 

If p; is the proportion of the population falling into the ith class, the expected number 
E(m,) in this class in a sample of R is for both types of distribution 


E(m;) = p,R. 
The expected variance of m;is V(m,) = p,1—p,)R 
in the case of a multinomial distribution, while for the hypergeometric 


Vim) = (1p) B(—7) 
where N is the total number in the population. 

These expected variances are tabulated in the third and fourth columns of Table 1, the 
observed s? (19D.F. in each case) being given in the last column. It will be seen that in nine 
cases out of ten the observed variance is greater than that expected for a hypergeometric 
distribution, and that in fact they approximate.much more closely to the multinomial 
variances. It seems likely that this discrepancy arises owing to the way in which the experi- 
ment was carried out. Thus, to take as an example the first interval of time, the removal of 
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forty-eight counters chosen at random from the 300 in the drum to represent the number of 
deaths occurring during the interval introduces an additional random element. If there are 
(N — Ry) unmarked and R, marked individuals, we should not expect to remove precisely 
0-16 (N — R,) and 0-16 R, counters by this procedure. The assumption that at ¢ = 1 we are 
withdrawing a sample from a population in which the death-rate has fallen equally on the 
marked and unmarked class is thus not fulfilled. We really require some compound probability 
distribution for the expected numbers at the first resampling, and similarly for all the rest 
of the sequence. It might be expected that the general effect of this additional random 
element would be to increase the variance of the observed numbers occurring in the sub- 
classes at each sampling, and it is suggested here that the excess of the observed variances 
in Table 1 over those expected for a hypergeometric distribution is due to this cause. 

The point raised here may be of considerable practical importance, since in the case of 
a biological population the death-rate during some interval of time must be regarded as 
a probability rather than as a constant parameter. Strictly speaking, some appropriate form 
of stochastic model is needed to describe the changes in the population and the expected 
distribution of marked individuals at each sampling. The chief difficulty which may arise 
from the use of a deterministic model is in regard to the variances of any estimates which 
we may make. If an estimate of the variance of some parameter is derived from the maximum- 
likelihood equation for this type of model, it is possible that this may be an underestimate of 
the true sampling variance, more especially when the number of recaptures is small. Although, 
as will be seen from the results given in the next section, the variances observed between 
our twenty replicates are very close to those expected theoretically under the assumptions 
which have been made, this agreement may only be due to the scale on which this experiment 
was carried out. For smaller samples, or for a smaller proportion of the population sampled, 
the results might not have been so satisfactory. However, we shall be returning to this point 
in a later section. 

8. RESULTS OF THE SAMPLING EXPERIMENT 

The following figures for the experimental population show how the total amount of informa- 
tion in regard to P should increase with the length of the sampling chain. These values of 
Ip were obtained by inserting the expected numbers in the appropriate equations for 0?L/0P?. 
Thus, the amount of information increases very rapidly with the length of chain, and the 











Ip for data grouped by 
Chain 
ending at ¢ 100 A/B 
Method A Method B 
2 6-41 8-79 73 
3 34-34 53-74 64 
4 110-38 203-29 54 




















loss of efficiency through grouping the data by Method A becomes progressively greater. 
The efficiency of A is only 54 % for a chain of five samplings. Naturally, it is difficult to make 
any general statements from figures which are based on a specific numerical example, but 
roughly we should expect much the same comparative degree of efficiency in any non- 
breeding population of this type, provided that our samples remained around 20 % of the 
individuals at risk. 
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The further problem considered in §6 was: given a limited period of time 7’ between the 


first and last samples in a chain, whether the increase in the amount of information through 
increasing the number of samples from the minimum of three, to a total of five, is sufficient 
to justify the additional labour and expense. In the present example we obtained the 
following expected figures. 











No. of Ip + 1000/P 
samples 
for the 
period 0-T' Method A Method B A B 
3 27-64 40-09 + 22-6 + 18-8 
5 110-38 203-29 +11-3 + 84 























Thus, taking B as the most efficient method of grouping, the average amount of information 
per sample should increase from 40/3 = 13-3 to 203/5 = 40-6, or roughly three times. If the 
average cost of each sampling remains constant, the two additional samples are therefore 
most repaying. We should expect the percentage error of our estimate of the survival factor 
(P = 0-84) to decrease from about 19 to 8-4, which is quite a marked increase in precision. 

It is interesting to see how far these expected figures were realized in the sampling 
experiment. The estimates of P are given in Table 2 for each of the twenty replicate drawings 
BG inept 


Three-point sampling 


In this series only the samples taken at t = 0, 2 and 4 were considered, and in the first two 


columns of Table 2 the estimates P are expressed per unit of time. (The values of P over 
the period 0-2 were calculated directly from the equations for a three-point sampling given 
in §5, pp. 277 and 278.) It will be seen that the two methods of grouping gave very similar 
results. On twelve occasions out of twenty the estimate from Method B was nearer the true 
value of P = 0-84; but the means and variances for the twenty replicates were almost 
identical. The two means are slightly below the true value, but the difference is not particularly 
marked, since the standard error is in both cases of the order + 0-035. Evidently there is 
little to choose between the two methods of grouping, as judged by these data, in the case of 
a three-point sampling. There is no evidence of any marked bias in the estimates, and the 
differences between the observed and expected variances are no greater than those which 
might easily occur in random samples of twenty. But, although in one sense these agreements 
with expectation are satisfactory, the general run of the results is not very promising from 
the point of view of the biologist. Actually, in three drawings out of twenty, we obtained an 
estimate of P which was greater than unity. If any one of these three drawings happened to be 
the result of a single experiment in the field, this would lead to the nonsensical conclusion 
that a negative death-rate was in operation.* In other cases we should grossly overestimate 
the death-rate. Clearly, a three-point sampling of around 20 % of a population consisting 


* It is to be noted that in the mathematical development no definition of the symbol P has been 
made restricting it to values <1. Exactly the same equations would arise in the imaginary case of 
a population increasing, for instance, by division, in which any parent possessed the peculiar property 
of handing on to the offspring the same combination of marks it happened to have at the moment of 
division. 

Biometrika 38 19 
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of a few hundred individuals is not a very efficient method of estimating this parameter. 
Nevertheless, as we hope to show in a later paper, the results of such a chain may still be used 
to obtain an estimate of the total numbers, which will very often be sufficiently accurate, if 
we are interested merely in the order of magnitude of the population sampled. 


Five-point sampling 

The estimates of P were obtained in both series by the iterative solution of the equations 
given in §5. In the case of Method A, the solutions in this experiment were greatly simplified 
by the fact that the number of individuals marked was the same at each sampling. Hence, 
as will be seen by putting R, = R, = ... = Rp in the appropriate equation, a series of tables 
for trial values of P could be prepared. (This property of the equation for estimating P from 
the data grouped by Method A may possibly be of use in the design of field experiments, 
whenever the size of each sample can be fixed in advance and a large amount of data is to be 
collected and analysed.) No great accuracy in the solution was attempted, and an interval 
of 0-05 in the argument was adopted for the tables, the final estimate being made by simple 
linear interpolation between the two nearest trial values. No tables could be prepared for 
the solution of the equation for Method B. Generally speaking, the same interval of 0-05 
between trial values of P was adopted in the iteration, though occasionally if the first value 
tried gave an answer very close to the solution, a smaller interval would be chosen. The 
estimates of I, = — 0*L/0P? obtained from the iteration are given in Table 2 for both series, 
the reciprocals of these values being, of course, the estimated variance in each case. Although 
no great accuracy was attempted in the iteration, it is interesting to see that, in both series, 
the mean of these estimates of J; is very close to the expected value. 

It will be seen from the results given in Table 2 that the expected difference between the 
two methods of grouping is more fully realized in the five-point sampling. In only three cases 
out of twenty was the estimate of P from the data grouped by Method A closer to the true 
value of P = 0-84, and in one of these cases the difference between the two methods was 
trifling. On the average the estimates for Method B were nearer the true value by an amount 
equal to 0-0171+0-0072. This greater degree of precision is represented in the observed 
variances between replicates, which were s* = 0-004227 for Method B, compared with 
8* = 0-007647 for Method A. Thus, judging from these results, the efficiency of A would be 
reckoned as about 55 %, a figure which is remarkably close to the theoretical one of 54 %. 

Both of the observed variances are somewhat less than those expected. The discrepancy, 
however, is not at all marked and is of a degree which might very well arise in a random sample 
of twenty observations (x? = 16-0 and 16-3 for series A and B respectively, there being 
19p.¥.). Out of interest, we also calculated the expected variance, when allowance was made 
for the size of the samples relative to the population at risk, by multiplying the contribution 
to 0*L/¢P? at each sampling by the appropriate factor (N,—1)/(N,— R,). It was found in this 
way that for the data grouped according to Method B the expected o? = 0-003373, a figure 
not very much below that of o? = 0-004919 for an infinite population. The observed variance 
between replicates, s? = 0-004227, is compatible with either value. 

The results of this experiment, therefore, bear out the theoretical expectations. In sampling 
for a limited period of time on this scale from a population of a few hundred individuals, the 
increased precision obtained through increasing the length of the sampling chain is apparent. 
It is evident, too, that the method of grouping the recaptures according to the interval of 
time since they were last captured is preferable to that of grouping by marks. Under 
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conditions similar to those in the experiment, the error of an estimate Pis thereby reduced 
to around 8 % for a chain of five samples. The agreement between the observed inter-replicate 
variances, both for the three- and the five-point samplings, with those expected for large 
samples from a deterministic model population is of particular interest. Keeping constant 
the proportion of the population sampled, and allowing the size of the samples (R) to increase, 
we should therefore expect the variance of an estimate P to be 50/R times the values given 
here. Thus, the variance would be halved for samples of 100 from a similar population of 
twice the size. The behaviour of the variance, if we were either to decrease the size of the 
samples, or to sample a smaller proportion of the population at risk, needs further considera- 
tion however. We may inquire what the results of this experiment would have been if we 
had taken a much smaller number of counters at each sampling of the population. Would 
the variance between replicates then have been in so close agreement with expectation? 


Table 2. Estimates of the survival factor P from the data grouped by Methods A and B 
(Expected valus, P = 0-840) 

















Three-point sampling Five-point sampling 
Replicate Method A Method B Method A Method B 
A A A A 
P P P Is P i. 
A 0-775 0-821 0-744 161 0-774 263 
B 0-620 0-609 0-725 138 0-784 263 
Cc 0-798 0-702 0-809 93 0-874 199 
D 0-782 0-665 0-995 105 0-912 211 
E 0-646 0-719 0-774 116 0-816 199 
F 0-832 0-858 0-931 89 0-917 127 
G 1-080 1-161 0-816 101 0-849 284 
H 0-853 0-784 0-852 98 0-847 196 
z 0-866 0-898 0-899 100 0-872 153 
J 0-935 0-930 0-845 114 0-884 223 
K 0-707 0-733 0-697 131 0-699 230 
L 1-069 1-058 0-863 9i 0-862 175 
M 0-707 0-721 0-811 98 0-837 201 
N 0-798 0-710 0-834 116 0-836 192 
O 0-598 0-602 0-793 138 0-798 241 
P 0-598 0-674 0-721 137 0-738 236 
Q 0-961 0-869 0-994 86 0-942 189 
R 0-775 0-792 0-907 93 0-884 179 
8 0-817 0-850 0-708 144 0-751 303 
T 1-118 1-107 0-843 103 0-904 176 
Mean 0-8131 0-8167 0-8280 112-6 0-8390 212-0 
s* (19 D.F.) 0-02597 0-02475 0-007647 — 0004227 - 
Expected 0-03618 0-02494 0-009058 110-4 0-004919 203-3 
o* or Ip 





























9. THE EFFECT OF REDUCING THE SIZE OF SAMPLE 


An attempt was made to answer this question from the data already collected in the experi- 
ment, by supposing each sample to have been half the original size. As we have indicated 


19-2 
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in §6, the individual counters forming each sample of fifty were ticked off on a list of the 
whole 300 counters used in the experiment. These lists were gone through, and the entry 
for every alternate counter in each sample was marked with a circle. Considering in the 
first place only the entries marked with a circle, we then had a chain of five samples of 
twenty-five for each of the original twenty replicates, and the results could be tabulated in 
any way which seemed the most convenient. Similarly, by considering the entries which 
were not marked with a circle, we could obtain another set of twenty chains. For each of the 
replicates A, B,C, ...,T, we therefore had a pair of chains, each consisting of five samples of 
twenty-five, from the original population. 

The recaptures in each chain were then grouped by Method A, simply in order to reduce 
the labour of estimating the forty values of P. For a five-point sampling, in which each 
sample consisted of twenty-five counters, the expected number of marks in the various 
k,, classes for this method of grouping ranged from 2-08 to 3-51, compared with values four 
times as great in the case of the original samples of fifty from this population. Thus, by 
halving the size of sample, we have reduced the expectations to relatively small numbers. 
The expected variance of P from the maximum-likelihood equation was 0? = 0-0362, which 
is the same as the expected variance for a three-point sampling of fifty for the original data 
grouped by Method A (see §8, Table 2). 

The mean of the forty estimates was P = 0-864. We then assumed that we had a pair of 
independent estimates for eaclt of the twenty replicates (the correlation between the twenty 
pairs was r = + 0-10) and carried out an analysis of variance in the usual way. The following 
were the results, together with the values of y? obtained by contrasting each mean square 
with o?: 











D.F. Mean square - - 
Between replicates 19 0-06632 34:°8 <0-:02>0-01 
Within replicates 20 0-05412 29-9 <0-10> 0-05 
Total 39 0-06006 























These variances are greater than o* = 0-0362; indeed, the discrepancy for the ‘between 
replicates’ s*, which is the comparable variance to those calculated in the previous section, 
would be reckoned as significant. In addition, the values of If were calculated from the 
iteration, and the reciprocal of the mean of these forty values was 0-0337. On the average, 
therefore, the individual estimates of the variance tended to be less than the observed 
variance between replicates. 

The results of this experiment to investigate the effect of reducing the size of sample are 
in no way decisive. They suggest, if nothing more, that further work is needed on this 
problem, which is one both of theoretical and practical importance. It would be extremely 
interesting to know whether these discrepancies between observation and expectation could 
be attributed to the use of a deterministic, instead of a stochastic, model to represent the 
sampling chain, or to the fact that the expected numbers in the various classes were small; 
or, finally, whether they should be regarded merely as a chance occurrence of the somewhat 
rare type of event which must always be reckoned with as a possibility in a sampling 
experiment of this nature. 
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10. SumMaRY 


The problem considered in this paper is that of estimating the death-rate in a population 
which is continuously changing in numbers, from the results of a chain of successive samples 
at each of which a number of individuals are captured, marked and returned to the popula- 
tion (the capture-recapture method). In order to solve this problem in a comparatively 
straightforward fashion, certain simplifying assumptions must be made, of which all may 
not be realized in practice. In the particular case studied, the death-rate Q per unit of time, 
or alternatively the survival factor P = 1—Q, in the population as a whole, is assumed to 
remain constant over the period of sampling. 

At each resampling of the population the history of the marked individuals is found from 
the marks received at previous captures. Since the number of possible classes of previous 
markings increases very rapidly with the length of the sampling chain, it is convenient to 
adopt some method of grouping the observations, and the efficiency of various methods is 
discussed in relation to the total amount of information yielded by the data. It is shown that 
the method of grouping which consists in forming the distributions of marks observed among 
the recaptured animals, and which is termed here Method A, results in a loss of information. 
Nevertheless, this method may still be the most convenient when the proportion of recaptures 
in the samples is low and a large amount of data has to be analysed. Another method, termed 
here Method B, in which the recaptures are grouped according to the interval of time since 
they were last captured, was found to have certain advantages, of which the most important 
is that there is no loss of information. Compared with Method A, therefore, a more accurate 
estimate of the survival factor should be obtained from a given set of data grouped in this 
way. The maximum-likelihood equations are given for estimating the survival factor from 
the data grouped by these two methods, and numerical examples of the solutions are also 
given in an appendix. 

In order to see whether the comparative efficiency of these two methods of grouping would 
be realized in the type of data which might be met with in practice, a sampling experiment 
was carried out in the laboratory. The scale of the experiment and the type of population 
were suggested by the analysis of some field data. The artificial population, which initially 
consisted of 300 numbered counters, was assumed to be subject to a constant ‘death-rate’, 
and five samples, each of fifty counters, were taken at equidistant intervals of time. The 
whole chain of five samplings was repeated twenty times. 

The results of the main experiment were in close agreement with those expected 
theoretically from the maximum-likelihood equations for a deterministic model of the 
population. They are also used to show how the totai amount of information in regard to the 
death-rate increases very rapidly according to the number of samples taken within a limited 
period of time, a point which is of importance in relation to the cost and design of investiga- 
tions in the field. Using the data collected in this experiment, an attempt was made to 
investigate the effect on the variance between replicates of reducing the expected numbers 
in the various classes of recaptures to relatively small numbers. The results were inconclusive, 
but were sufficient to indicate that further work is needed on this question, which is one not 
only of theoretical, but also of practical interest. 


The expenses of this investigation were partly paid for by a grant from the Agricultural 
Research Council. 
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APPENDIX I 
Numerical examples of the solutions of the equations 

The iterative solutions of the maximum-likelihood equations for the two methods of grouping the data 

are illustrated here on the same numerical example (replicate M). 


Method A 


Writing the observed k,, in a form of triangular diagram, which is a convenient way of exhibiting the 
data, the equation to be solved is 


egg + 2k + Skog 
+ keyg + 2ky, 
+ Key 
8,P*R, 8,(2P?R,+P?R,) | 3,(3P?R,+2P?2R,+PR,) 
“Me [ aE +P?R, P?R,+P?R,+PR, P*R,+P?R,+PR,+R, 
=:@, 





It will be seen that the positive term in this equation is merely a function of the observations and can 
be calculated immediately. In order to obtain the quotients for multiplying the observed sums of marks 
(8;), we may proceed as follows. (The same type of calculation also occurs in the solution of the equation 
for Method B.) Choose some trial value of P and write down the required powers of P in a column, starting 


with the highest power. Opposite these enter the appropriate values of R; and form the products. The 
remaining steps may be set out as follows: 


S; S; 
P*R, PR, 
P*R, P*R,+P*R, P®R, 
PR, P*R,+P*R,+PR, 2P*R, + P?R, 


R, P®R,+P?R,+PR,+R, 3P°R,+2P?R,+PR, 


The entries in the first column are summed from above down to give the second or S, column. This is in , 
turn summed, again from the top, to give S,, the first term being entered one line lower down and so on. 
Each entry in the S, column is then divided by the appropriate entry in S,, giving the quotients by which 
the observed s; must be multiplied. For machine work it is not necessary to write down the products of 
the powers of P with the number of captures, provided the same number of decimal places is retained, 
since the first summation can be carried out directly on the machine. 
The following are the recorded results (k,,) for replicate M, together with the various totals which are I 
required in the calculations: : 






































: 0 1 2 3 4 
x 
0 9 6 10 5 
1 7 7 10 
2 15 10 
3 ° ll I 
8; ; ‘ 13 32 36 
R, 50 50 50 50 : 
Then S(k) = 64+2x104+3x5+74+2x 10410 = 78. 
Taking the trial value, P, = 0-8, we have 
S, S, S,/S, 8% 
Pi = 0-512 25-600 
Pj = 0-640 57-690 25-600 0-4444 13 
P, = 0-800 97-600 83-200 0-8525 32 


1 147-600 180-800 1-2249 36 
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} Then 13 x 0-4444 + 32 x 0-8525 + 36 x 1-2249 = 77-154, 


and subtracting this quantity from 78, the value of P,aL/éP, for P, = 0-8 is +0-846. 
Since this quantity is positive, the required value of P is greater than 0-8. Taking a ne « trial value 
data P, = 0-85, and repeating the summations, we have 


P,dL/éP, = 78— 81-263 = — 3-263. 
: the } Hence, for P, = 0-8 and P, = 0-85, 


Thus, by interpolation, we have as a first approximation, 


A 1-058 x 0 95 
= 0-8 +——__—__ — = 08]l; 
- ™ 4-897 
: and Ip = 1/V(P) =4-897/0-05 = 98; 
can whence the estimated variance of P is 0-01020, and the standard error s = + 0-101. 
arks If a more accurate solution is needed, the value P = 0-811 is then substituted, and so on. The method 
tion of obtaining a more accurate estimate of Ip, given the results for three trial values of P, is illustrated by 
ting Fisher (1950, §57-2). 
The 
Method B 


In order to obtain a form more suitable for the calculations, the equation for @L/@P = 0 in §5 is first 
.multiplied throughout by P and rewritten as 
Mo P?Ry Moq(2R2R,— P*m;) r Mos(3P3 Ry — 2P?m, — Pq) 


P®R,— P*m, ¥ P?R,—P?m,—Pm, =P? Ry— P?m, — Pm, — Moz 





5 in } ™,,F*R, My4(2P?R, — Pm) 
on. P?R, — Pm, P?R, —Pm,.— M43 
7: F PR 
ed, PR, — Mz 
8,P°R, 8,(2P°R, + P®u,) 8,(3P®R, + 2P2u, + Pug) 
= & = PR, +P, P*R,+P%,+Pu, * P®*R,+ Pu, +Puyt us” 


Looking at the three rows of the triangular form in which the left-hand side (L.H.s.) of the equation is 
written, it will be seen that the six quotients involving the powers of P may be obtained by three series of 
double summations in the same way as the quotients in the previous example. Similarly, the three 
quotients in the right-hand side (x.H.s.). The actual calculations are thus of the same form, apart from 
the question of signs, as in the case of Method A, but the whole process is lengthier. 

Taking the same replicate M as an example, we have the foliowing table of the numbers observed in 
the different classes (m,,), together with the number of unmarked individuals (u,) which will be required 
in the calculations: 



































: 0 1 2 3 4 
x 

0 9 6 7 2 
1 : 7 5 6 
2 15 7 
3 ; ll 
8, ° P 13 27 26 
u, 50 41 37 23 j 
R, 50 56 50 ; 
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Taking P, = 0-8 as the first trial value, we have the following summations for the three rows of the 
L.H.s. It will be seen from the equation that in each case the first summation column involves a series 
of subtractions, and the second summation a series of additions. 


S; S, S,/S; Met 

P? = 0-512 25-600 
P? = 0-640 19-840 25-600 1-2903 6 
P, = 0-800 15-040 45-440 3-0213 7 
1 8-040 60-480 7-5224 2 

P? = 0-640 32-000 
P, = 0-800 26-400 32-000 1-2121 5 
1 21-400 58-400 2-7290 6 

P, = 0-800 40-000 
25-000 40-000 1-6000 7 


Hence, by adding the cross products of the last two columns, the L.H.s. = 77-570. 
The calculations for the R.H.s. are similar, except that only additions are involved. 


S; S, S/S; 8; 

Pi = 0-512 25-600. 
P? = 0-640 51-840 25-600 0-4938 13 
P, = 0-800 81-440 77-440 0-9509 27 
1 104-440 158-880 1-5213 26 


and the R.H.Ss. = 71-647. 
We have, therefore, for P, = 0-8, P,A@L/éP, = + 5-923. (It is important to remember that in rearranging 
the original equation for @L/€P = 0, we multiplied throughout by P.) Since this quantity is positive, 


a larger value of P, say P, = 0°85, is tried. Carrying out the same series of summations, etc., we find 
P,@L/éP, = — 2-240. Thus, for P, = 0-8, and P, = 0-85, ~ 


oL oL 

— = +7404, —— = —2-635, 

aP, + 7:40 aP, 63 

7-404 x 0-05 
and for our first approximation, P = ealkgs aa = 0-837. 
10-039 
Also Ip=10-039/0-05 = 201, V(P)=0-004975 and s=+0-071 
APPENDIX II 


Corrections to allow for accidental deaths 


It has been assumed hitherto that the number of individuals captured at each sampling (C,) is equal to 
the number marked and returned to the population (R;). In practice, however, it occasionally happens 
that a certain number of accidental deaths occur owing to the method of capture employed. Thus, some 
animals may be found dead in the traps, or through some accident in handling are so damaged that it is 
necessary to destroy them. Again, a group of individuals may be removed from the population for some 
purpose or other. It may be necessary, therefore, to allow for such losses at each sampling in estimating 
the survival factor. 

In the case of Method B, the corrections which must be made in the equation given in Appendix I are 
extremely simple. The number of unmarked animals (u,) and the number of recaptures falling into the 
various classes (m,,: Lm = 8) will be based, naturally, on the total number of captures, dead and alive, 

x 


at time ¢t. Let D, be the number of accidental deaths and removals at time ¢, so that the number marked 
and returned alive to the population is R, = C,—D,. Then, if we write out the expected numbers in the 
population as a whole at each sampling, in a similar form to that given in § 4, it will be found that the 
maximum-likelihood equation for estimating P is the same as that given in Appendix I, except that the 
u,(t = 1, 2,3) in the R.u.s. are replaced by quantities which we shall define as 


Y= U—D,; 


namely, the number of unmarked individuals captured, both dead and alive, less the total number of 
accidental deaths and removals. Thus, to summarize the corrections: in the equation for Method B, base 








Replicate 
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the m,; and the s, on the numbers capiured, the R, on the numbers returned alive to the population, and 
finally replace all u, by y. 

In the case of Method A, the full corrections which should be made are more complicated. Thus, it is 
necessary to form each time the distribution of marks observed among the accidental deaths and removals. 
These values must be subtracted from the expected population numbers in the appropriate classes. at 
each sampling. The full equation which results is very similar in its general form to that for Method B, 
and thus the comparative simplicity of the original solution is lost. Owing to the loss of information which 
results from grouping the data by this method, it therefore seems simpler to adopt Method B in the first 
place and apply the corrections given above. Nevertheless, if the number of accidental deaths is not large, 
or the recaptures form only a small proportion of each catch, we may still group the recaptures by 
Method A and retain the numbers released alive (R,) in the equation. This is equivalent to assuming that 
the accidental deaths occur among the unmarked animals at each sampling. This procedure will probably 
be accurate enough for all ordinary purposes, provided the number of losses is small. 


APPENDIX ITI 
Detailed results of the sampling experiment 


The detailed results of the sampling experiment are given in Table 3, where the observed recaptures 
Toie...¢ at time ¢ are tabulated, for each of the twenty replicate chains A, B,C, ...,8,T, according to the 
n = 2*—1 possible classes of suffix 012 ...¢. The total number of counters in the population at t = 0, 1, 2, 3,4 
was 300, 252, 212, 178 and 150 respectively, and each sample consisted of fifty counters. Tie method of 
carrying out each chain of drawings is described in § 6. Only the total number of previously marked 
counters (s,) recovered at time ¢ is given in the table, since the number unmarked can easily be found by 
subtraction, u,= 50—<s,. The last two columns give the observed means and those expected from the 
deterministic model of the population. 
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1. INTRODUCTION 


The estimation of the total size of plant or animal populations is of great importance in 
a variety of biological problems, which may relate to population growth, ecological adapta- 
tion, genetic constitution, natural selection and evolution, and so on. Obvious practical 
consequences are the maintenance of human food supplies and the control of insect pests. 
For human communities and populations of sessile organisms procedures employing fixed 
sampling units are available, but for mobile populations other methods must be used. 

The basic technique, which seems to have been first used by Lincoln (1930) to estimate the 
total number of duck in North America, and is sometimes referred to as the ‘ Lincoln Index’, 
is as follows. One catches, marks and releases a certain number of animals taken at random 
from the population. A further random sample is caught and the proportion of marked 
animals noted. Then the total number of marked animals released divided by the proportion 
of marked animals in the sample captured can clearly be used as an estimate of the total 
population size. The same method was adopted independently by Jackson (1933), who used 
it for estimating the true density of tsetse flies. Jackson (1937, 1939) subsequently extended 
his treatment to allow for both birth- and death-rates, advocating his ‘negative’ and 
‘positive’ methods. Further work by Jackson (1940) took into account the effects of migration. 
Recapture methods have also been successfully applied to populations of Lepidoptera. 
Dowdeswell, Fisher & Ford (1940) caught, marked and released moths on several different 
days. On each day the marked insects were classified according to the day on which they 
had previously been released. Analysis of the data, which could be exhibited in a triangular 
array, or ‘trellis’ diagram, gave estimates of the size of the moth population for each day. 
The interpretation of trellis diagrams was further developed by Fisher & Ford (1947). They 
estimated daily numbers, taking into account a death-rate which was obtained from the 
average time interval between marking and recapture. A more detailed application of this 
procedure was made by Dowdeswell et al. (1949). 

Although in the various papers cited above effective use has been made of estimates of 
population size and of birth- and death-rates, there has been little discussion for the most 
part of the precision of the results obtained. Jackson (1937, 1939) obtained both the popula- 
tion estimate and its variance for the ‘negative’ and ‘positive’ methods by fitting a curve 
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to a series of ‘standardized’ recapture rates. However, this was not really adequate, as it 
did not employ satisfactory weighting factors. Schnabel (1938) considered the problem of 
finding the total number of fish in a lake, and obtained the maximum-likelihood estimate of 
the population size for repeated sampling of the recapture type. Although approximations 
to the solution of the likelihood equations were given, there was no discussion of the precision 
of the results. De Lury (1947) has used a somewhat different approach to the same problem. 
Specimens caught are removed from the population, and it is assumed that catches are 
sufficiently large to produce an appreciably diminishing catch per unit effort with repeated 
catching. An appendix to his paper gives a full discussion of the appropriate statistical model. 

It is the purpose of the present paper to consider the precision of estimates of population 
size, and of birth- and death-rates derived from recapture data. We shall deal first with simple 
recapture, considering both direct and inverse sampling, and then go on to a new discussion 
of Jackson’s ‘negative’ and ‘positive’ methods. Although, in general, the simultaneous 
estimation of population size and birth- and death-rates from a large trellis diagram is 
exceedingly tedious, it will be shown that in at least one special case, that of ‘triple capture’ 
relating to catching on only three occasions, a satisfactory explicit solution is available. In 
certain circumstances, therefore, it might be worth while making a special attempt to collect 
data in this form. 

We shall, in the present discussion, assume that we are dealing with closed regions, 
unaffected by migration. Further, when birth- and death-rates are involved, it will be 
assumed that the numbers of marked animals are sufficiently large for these rates to operate 
deterministically. 


2. SIMPLE RECAPTURE 
(a) Direct sampling 
Suppose that we capture, mark and release a animals out of a total population of size z. 
When the marked animals have freely mingled with the unmarked, we catch a random 
sample of size n, of which r are found to be marked. We shall assume that 7 is sufficiently 
small compared with x for us to be able to ignore the complications of sampling without 


replacement. The large sample theory of the maximum-likelihood approach will be appro- 
priate if we envisage n, a and z all tending to infinity while keeping constant ratios to each 


other. The likelihood is P 
Lt. ("\ (4 x—a\"* : 
ear) - i 


Therefore L = const. + (n—1) log (x —a)—nloga. (2:2) 


Although z is in fact discrete, it will be convenient to treat it as though it were a continuous 
parameter. Differentiating with respect to x gives 





= 2-55 
ox x—-a 2 (2:3) 
Hence the maximum-likelihood estimate of x is 
2 = an/r. (2-4) 


Schnabel (1938) discussed the solution of the corresponding likelihood equation for a set of 
independent les, e.g. 
independent samples, e.g p» (n,—1,)|(a—a,) = Em2, 
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where the suffix 7 indicates the values of n, r and a appropriate to the ith sample. She did not, 
however, go on to examine the precision of the estimate thus obtained. 
Differentiating (2-3) with respect to x, and then taking the expectation provides the 
required information in the case of a single sample. Thus 
OL 


IL=-E —* an|{a?(a —a)}. (2-5) 


Using the value of x given by (2-4) we can then write 
var# = IZ} = a? n(n—1)/r°. (2-6) 


Now @ has an infinite expectation if we admit the value r = 0. On the other hand, this value 
will occur so rarely in large samples, for which m = Fr is not small, that we can choose to 
exclude it. With this convention it can be shown that the relative bias in the expectation of 
Zis of order m-'. For, adopting the technique employed by Anscombe (1948), we can expand 
the right-hand side of (2-4) as a Taylor series in powers of t = r—m. Taking expectations of 
both sides then leads to an asymptotic formula for E2 (r +0). We find 


Banx(l +==*+0(m-*)) = x(1+5+0(m-)), (2-7) 


if m is small compared with n. 

Suppose x = 1000, and x = a = 100. Then m = 10, and although the sample size is as large 
as 100, there is an average relative bias of the order of 10% in the maximum-likelihood 
estimate of x. A slightly adjusted estimate which suggests itself is 


& = a(n+1)/(r+1). (2-8) 


The expectation of % can be found exactly. We have 
~ ey An+1) (n\ (a\" (x—a\"™ 
Be = 3 ry) G) Ce) 
mn (n+1)\ (a\"*1 (x—a\"" 
ECG Gy 


w~—a 


n+1 
Therefore Hi =x (1 - (=) \. (2-9) 








_g\ ntl 
For large n (=) ~em, (2-10) 


which is quite small even for moderate m. For the values given above, the relative bias is 
now less than 2-5 x 10-5. A satisfactory expression for the variance of % is somewhat more 
difficult to find. However, using the expansion technique referred to above we can derive 


the asymptotic series 3 2 et 
pe POL (142 4...), (2-11) 





m2 
where p = a/x. We now obtain the variance from the relation 


var% = EX? — (Hz). (2-12) 
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We could obtain a single-series expression for the variance by substituting (2-11) in (2-12), 
and writing P 

. 1)? 2 
(EZ)? ~ 2? = a®n?/m? = (1 —-+ my . 

m n 

but as this would involve the consideration of inverse powers of n, there seems little to be 
gained by this device. 

Fortunately, there exists a nearly unbiased estimate of the variance of %, which is quite 
convenient for use with samples that are not too small. 

It is easy to show that 











a(a+ 1)(n+2) | v *=)"- | (*) = ' 
E (+1) (r-+2) = 77/1 - (n + 2) 3 ( _ (2-13) 
~2*(l1—me-™), for large n. (2-14) 
Therefore, if we write 
_ (n+? _ an +1) (n+2) _ aX(n+1) (n=) men 
(r+1)®  (r+1)(r+2) — (r+1)2(r +2) ’ 
then ET = E# —2?(1—me™) ~ a2 + 2*me™, (2-16) 


neglecting quantities of order e~”. Now we know from (2-5) that a7 is of order x*/m, so that 
ET ~o3(1+m?e-™), (2-17) 


The relative bias of T in our previous example is thus of order 2-5 x 10-%, which should be 
quite satisfactory for practical purposes. 

It is worth mentioning here that in certain ecological problems we may be more concerned 
to use the reciprocal of the population size, 1/2, as the appropriate index, rather than the 
population size itself. It follows from (2-4) that the maximum-likelihood estimate of this 
reciprocal is r/an. Consideration of the binomial distribution in (2-1) shows that this estimate 
is unbiased and has variance (x —a)/(anz?). 


(b) Inverse sampling 


Now in the above treatment we studied the mean and variance of estimates of the popula- 
tion size x in relation to repeated sampling for which the total size of the samples drawn was 
regarded as fixed. Various writers (e.g. Haldane, 1945) have considered the method of inverse 
sampling, according to which random sampling is continued until a certain predetermined 
number of individuals with a particular attribute has been obtained. The use of inverse 
sampling with simple recapture results in an appreciable simplification—an unbiased 
estimate of population size and an exact value of the sampling variance are available, even 
in the most general case of sampling without replacement. It should, of course, be mentioned 
in passing that it may not always be possible to adopt this method of sampling in practice. 

Suppose we catch, mark and release a animals in a total population of size x. We then 
draw a random sample by the inverse method, continuing our catching until we have 
m marked animals altogether. The total number of animals in the sample is n, which is now 
the random variable under consideration. 

The probability of obtaining a sample of size n is the probability of drawing first a sample 
of size (n — 1) containing (m — 1) marked animals, followed by the drawing of just one further 


a 





); 


SS 
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marked animal. Having regard to the existence of sampling without replacement, the 


likelihood is clearly a li 
P(n) = m—1} \n—m] (a—m+1) 


Re (x—n+1) 


=2(¢-4) (2-2) /@-}). (219) 


where O<m<a and m<n<2x+m—-a. 





(2-18) 


It follows from (2-19) that 


ECC) ey Ene) om 
m—1 


The expectations of n and (n+ 1), which we shall have occasion to use below, are readily 


derived as follows: oy ae, a’ 
Ba En) ae” ee 
a—1\*tm—-4 (x—a)\ | (x 
-a(7) nan rl: aie 


The summation on the right-hand side of (2-21) is obtained immediately from (2-20) on 
writing x+1,a+1,+1 and m+1 for a, a, n and m respectively. Therefore 


En=a("~\) @+1)[e+1)(%) -= (2-22) 
Similarly, En(n+1) = a ta 1) Coat i) 
miear ar MR 0) | OC 


To evaluate the summation on the right-hand side of (2-23) we write 7+ 2,a+2, +2 and 
m+ 2 for x, a, n and m in equation (2-20). Therefore 





En(n+1) = (@e+1)a(%— 4) (+2) [(a+2) eer) _ m(m-+1)(%+1)(e+2) (2-24) 


m+1 (a+ 1)(a+2) 


Now, strictly regarded, x can take only integral values. Thus the greatest value attained 
by the likelihood is derived by considering the ratio 


P(n|x+1)_— (w—a+1)(4—n+1) 
P(n|x) ~~ (+1)(ez—a—n+m+1)’ 





(2-25) 


using (2-19). It follows from (2-25) that 


P(n|x+1)2P(n|2), 


(2-26) 
according as xs — =f, 
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Unless an/m is an integer, the likelihood attains its greatest value when x is equal to the 
integral part of an/m. If an/m is integral the greatest value is attained at both (= ~ 1) 
and —. 
m 
In practice z is usually sufficientiy large for us to be able to ignore its discreteness. In any 
case we are led to consider the estimates of the general type an/m. It follows from (2-22) that 





of ae (227) 
™m 


is an unbiased estimate of x, for any values of x, n, a and m whatever. In large samples 
x’ approximates to the maximum-likelihood solution. The exact small sample variance of 
x’ is also easily evaluated, using the results in (2-22) and (2-24). We find 


(a—m-+1)(x+1) (x-a) 


WEF m(a+ 2) 





(2-28) 


3. JACKSON’S ‘NEGATIVE’ AND ‘POSITIVE’ METHODS 

(a) The ‘negative’ method 
The ‘negative’ method was first described by Jackson (1937), and simply consists in the 
repeated catching, marking and releasing of animals on successive occasions, making no 
record of the number of recaptures until the last occasion. Under certain circumstances, when 
the preliminary marking can be carried out by relatively unskilled workers while the final 
scoring of the numbers of recaptures requires a more elaborate set-up, this procedure may 
be very suitable. 

_ As mentioned in the introduction above, Jackson’s method of fitting a curve to 
‘standardized’ recapture rates is not really satisfactory owing to the lack of proper weighting 
factors. The present section will develop the maximum likelihood approach. 

Suppose that on the jth day previous to the last day a, freshly marked animals are released. 
Then, if we can assume a constant death-rate y, a,;e~”) of these animals will survive to stand 
a chance of recapture on the iast day. Of a total of n captures on the last day, let r; have been 
marked on the jth day previously and let ry be unmarked. With animals bearing more than 
one mark it will be convenient to take notice of the earliest mark only, although this will 
result in the loss of a slight amount of information. Then if x is the size of the population on 
the last day, we can write the likelihood as 








i ~yi\r 
a,e-“\"5 x ue 0 
ec [] eS ‘ 
j=1 z x 


(3-1) 

Therefore L = const. —nlog x — Y DI +ry log (x— Pat ev), (3-2) 
Let us write F(y) = % a,e-, 

F'(y) =- 2% e”, | (3°3) 


F'y)= Y jase”. 
j=1 








the 
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PP =A, (3-4) 


Further, put 


Then the likelihood equations are 
OL n To 











Ox “~2'z-FQ) ° (35) 
oL roF'(y) 
and x =—-A- = 0, 3-6 
By z—F(y) ries 
We can eliminate x from (3-5) and (3-6) to give 
x c;e-7 = 0, (3-7) 
j=1 
i). 
where C; = a,(j ae -) ’ (3-8) 


The quantities c; are readily determined from the observations, and the required root, ?, 
of (3-7) is easily obtained in practice by the usual iterative procedure. Substituting this 


value of y in (3-5) then gives 2 =nF(9)\(n—1). - (3-9) 
Further differentiation of (3-5) and (3-6), followed by taking expectations, gives the amounts 
of information I, = nF {a(x —F)}, 
I, = — nF" |{x(x — F)}, (3-19) 
L,, = nf F"? + F"(x—F)}| {a(x —F)}, 
where F, F’ and F” are calculated for y = ). It now follows from (3-10) that 


2 FF’ 4 
var? = lap}, (3-11) 
and var) = «F/{n(FF’ —F’)}. (3-12) 


It is, of course, important to realize that the foregoing treatment assumes that the death- 
rate operates ‘deterministically’. This will be satisfactory if the numbers of marked animals 
involved are sufficiently large; otherwise, ‘stochastic’ fluctuations may become important, 
and it is possible that, under such conditions, the variances of the estimates would be larger. 

Example. Jackson (1939, p. 241) gives some data for the ‘negative’ method carried out 
on a tsetse fly population. In our notation we have: 




















j 6 5 4 3 2 1 0 
a, 1262 1086 1299 1401 1198 1183 — 
r 1 5 17 28 48 70 1389 




















The total number, n, captured on the last day (j = 0) was 1558. It is not clear what con- 

vention was adopted with regard to flies marked more than once, but for the purpose of this 

illustration we will assume that on the last day notice was taken of the first mark only. 
Biometrika 38 20 
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The quantity A in (3-4) is 349, and the six values of c; are easily found to be 
c, = — 1260-00, c, = 2513-45, 
Co=— 77:98, ¢; = 3187-31, 
C,= 1309-81, c. = 4965-86. 


We now have to solve (3-7). Writing e~” = yw we take the trial values ~ = 0-55 and 0-54; the 
right-hand side of (3-7) is then + 29-193 and — 13-692 respectively. Interpolating gives 


fi = 0-543, 
> =—log, f= dain | 
The next stage is to compute the F’s from (3-3). We find 
F(?)= 1416-444, 
F'(}) = — 2923-873, 
F’(9) = 8327-083. 


We are now in a position to substitute in (3-9), (3-11) and (3-12), obtaining finally the 
following estimates with attached standard errors: 


# = 13,060 + 1890, 


Y= 0-611+0-060. 


(b) The ‘positive’ method 

The ‘positive’ method was also first described by Jackson (1937). This consists in marking 
a large number of animals on a single occasion. On each of a number of later occasions the 
proportion of marked animals is recorded. The initial population can then be estimated from 
these data, taking the birth-rate into account. Jackson again used the method of fitting 
a curve to ‘standardized’ recapture rates, as in the ‘negative’ method, and the same 
objection applies. 

The analysis of this type of data is of necessity more complicated as it entails sampling 
on a number of occasions, whereas in the ‘negative’ method sampling variation occurs only 
on the last day. We shall find that the maximum-likelihood treatment is more involved, but 
is fairly readily amenable to the usual method of calculating scores and information functions. 

In order to render the mathematical treatment as simple as possible it is desirable that 
nearly all the animals captured on each occasion be released again, otherwise, unless numbers 
are very large, it would be necessary to take into account the depletion of marked animals 
by the process of recapture. 

If we introduce a death-rate in this case we shall find that the corresponding factors in the 
likelihood cancel, as we should expect. It is, however, necessary to consider a birth-rate, /. 
Thus after j days the initial population of size x will have become z e4/. Suppose that, initially, 


we release a marked animals, and that j days later we catch n,; animals, r; of which are 
marked. It is easy to see that 


L = const.—logx ¥ nj—~ ¥ jnj+ ¥ (n;—71;) log (xe4/ —a). (3-13) 
j=1 j=1 j=1 


ne 
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The likelihood scores are S(x) = of » Pe 53, Nj, (3-14) 
oL i(n;—1; ‘ 
and (6) = = S wi 5-% jn,. (315) 


a = ‘ ° 
Tag = = Jnsl(weh —a),/ on 





Typ = ad jaja —a). 


Using (3-14), (3-15) and (3-16), we can follow the normal method of scoring. Thus if @ is 
a column vector representing approximate values of 2 and B, if S is the corresponding vector 
of scores, and if J is the information matrix, then improved Haar emereag sig to the maximum- 
likelihood estimates are given by 0,, where 


6,=6+I- 8. 


As with the death-rate in the previous section, so here we have assumed that numbers 
are sufficiently large for us to work with a deterministic birth-rate. With small numbers we 
should have to consider stochastic fluctuations. 


4. TRELLIS DIAGRAMS 


Jackson’s ‘negative’ and ‘positive’ methods, discussed in the previous section, are clearly 
special cases of a more general procedure in which all, or at least the majority, of the animals 
caught on a particular day are marked and released; at the same time all captured animals 
are carefully scored with respect to the days on which they were previously marked. As 
mentioned in the last section, it is convenient, when an animal has more than one marking, 
to take notice of the earliest mark only, although this must result in a slight loss of informa- 
tion. The record showing the numbers of captures, releases and recaptures can then be set 
out in a triangular array. Such arrays have been used with great effect by Dowdeswell et al. 
(1940, 1949) and Fisher & Ford (1947) in the analysis of Lepidoptera populations. An 
example of such an array for three days only is shown in § 5 of the present paper. Fisher 
& Ford have analysed these trellis diagrams, taking into account a death-rate estimated 
from the average time interval between release and recapture. Daily estimates of population 
size can then be made. While this method is extremely valuable in practice it is difficult to 
assess the precision of the results obtained. 

Let us introduce a general notation for these arrays. On the jth day let the total catch 
be n,;, of which n,, were first marked on the ith day; while nj) are unmarked. Further, let 
8; freshly sninchiod axibaaail be released on the jth day. The total population on the jth day is 
to be represented by N;, of which Nj; were first marked on the ith day, and Nj) are unmarked. 
Then we can attempt a large sample theory using the method of maximum likelihood, by 
adopting the following approach. 
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If we can assume that the N; and Nj, are sufficiently large for us to ignore the effects of 
sampling without replacement, then the likelihood is evidently given by 


k j-1 
coc TL TT (Nx/N)% (4-1) 


where the trellis diagram extends over k days in all. If we can further assume oe existence 
of constant birth- and death-rates, we have 


Ny, = 8,e7-9 ~(§ =1,...,(7-—1)), 
N; =N, eb—v)G—-1) (j = 2,...,&), (4-2) 
j-1 
and Nip = Nj— TN; 

Fa 


It will be realized that while the whole population is subject to both birth and death, the 
group of marked individuals suffers depletion by death but gains new recruits only from the 
release of newly marked specimens. 

We can now <ubstitute (4-2) in (4-1) and proceed with the ordinary maximum-likelihood 
estimation of #, y and, say, N,. This is a laborious undertaking in general, and in any case is 
hardly applicable to data where the numbers of recaptures are low, as with much of the 
Lepidoptera material, for the reasons discussed in § 2 above, though it might be worth while 
with the fairly abundant data such as is often available with tsetse fly populations. 

However, this approach is susceptible of a fairly simple treatment if we restrict ourselves 
to catching on three occasions only. This will now be discussed in the following section. 


5. THE ‘TRIPLE-CATCH’ METHOD 


Suppose that we confine ourselves to catching on only three occasions. Then in the notation 
of § 4, the numbers caught, released and recaptured are as shown below: 





Let us assume, as in the two previous sections, that it is legitimate to work with deter- 
ministic birth- and death-rates, which are taken to be f and y respectively. Thus a group of 
N individuals gives rise to a group of size N e—” after time t. Note, however, that newly 
born or emerged individuals are always unmarked, so that the nwmber of marked individuals 
is subject only to the death-rate. Also suppose that only a few captured specimens are not 
subsequently released again, so that the numbers of animals marked on different days in 
the population are not appreciably affected. 


| 
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Finally, let the time intervals between days 1 and 2, ana between days 2 and 3, be t, and 
t, respectively. Then, in the notation of § 4, the expected numbers of marked and unmarked 
animals in the population are given by 


Ny, = 8,e7%, Ngo = 82 e~742, 
Ngo = Nye#-P4—s,6-74, Nyy = 6,e-M+00, 
(5-1) 
N, = Neh M4, Nyy = Nye -Mh+) — 9, e-Ma— 8, eh tty), 
N, = N, e4-Mts, 


The general maximum-likelihood treatment is simplified if we put 


N, = N, eh = x, 


ele = J, (5-2) 
e7vh = LM. 
The likelihood can now be written 
el oc —(N2tN3) (x — [s,) "2% (Ax — fs, a= 8_)"30 A-MsynrtNay), (5-3) 


The maximum-likelihood equations are therefore 








aL ee a. MgoA 0, 

Ox x L—ps, Ax—ps,—8, 

oL Ns Nyy X 

hr oe Oo a (>) 
OL _My+Ms, M081 30% a 

Ou bh x—-ps, Ax—ps,—8, 





It is worth observing that we are estimating three parameters, and have just three available 
degrees of freedom, one from the sample on day 2 and two from the sample on day 3. This 
situation has been discussed by Bailey (1951), who pointed out that the equations given by 
setting the expected numbers equal to their expectations provide maximum-likelihood 
solutions, and are often simpler to solve than the likelihood equations themselves. At the 
same time it is necessary to check that the likelihood equations, regarded as linear functions 
of the observations, are linearly independent. The appropriate test, that the Jacobian matrix 
of the expectations with respect to the parameters to be estimated shall have rank equal to 
the number of those parameters, is easily seen to be satisfied in the present case. Thus we 
can either solve (5-4) directly, or we can solve the alternative equations, which, using (5-1) 
and (5-2), are evidently 


N, 
21 
bre he, in Np [t8,/2, 
2 


N, 
Ns2 = y, ng = N38,/Ax, r (5-5) 





31 pom 
"7 = ns 
3 


N. 
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omitting the other two linearly dependent equations. From (5-5) we have, immediately, 
B = 8yNyNg1/Nq1 Ngo, 
ebts = X = ng, Ng/NyNz), (5-6) 
e*h = fi = 84Ng,/8, Ngo. 


The large sample variances of these estimates can be most easily obtained from Fisher’s 
formula for the variance of a function of the observations 


2 2 
varT = sm(57) -n(5 





? 
a= 


where 7’ is a function of the observations a;, for which the expectations are m,, the total 
sample size being n. In the present case there will be terms like n(07'/dn)? corresponding to 
both n, and n,. Thus we have 


var? = a(— + +2-2),] 


Ne, N32 Ng, 





var A = m(-- +2 -7-2), P (5-8) 
Ra My Ny 
Nzq Ny) * } 
The large sample variances of Band ) are therefore 
var B = var 4/(2a/0h)? = (= pape ---2)/4 
Me My My 
(5-9) 


var) = var /i/(0u/0y)? = (= +) / tt. 
N32 31 
The results given in (5-8) can be verified by the alternative procedure of calculating and 
inverting the information matrix, though this is rather more lengthy. 

Having regard to the results of § 2, it is evident that the above formulae require 7,1, %32 and 
Ng, all to be fairly large. However, estimates of the type shown in (2-8) will be approximately 
unbiased for much smaller values. There is therefore some advantage in using the adjusted 
estimates % 

& = 84(%q + 1) M31/(Mq + 1) (Mg + 1), 
els = X =14,(ng + 1)/Mg(ng1 + 1), (5-10) 
e-Vh = fi = 82/8, (M39 + 1). 


As in §2 there is some difficulty in calculating the population values of the variances of 
estimates given in (5-10), but again it is possible to obtain approximately unbiased estimates 
of the variances of %, A and ji. Consider, for example, the variance of % in (5-10). In a manner 
similar to that adopted in §2 for simple recapture, we have 

(mq + 1) (mg + 2) Mg1(Mg1 — 1) (mz + 1) (NM, +2) Ng1(Ng, — 1) NZ Ni, x 





(No, +1) (M3 + 2) (gp +1) (Ne +2) " (Noy +1) (Mo + 2) (M32 + 1) (gp +2) ™ Ni,Ni, 83 
(5-11) 
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Consider, therefore, the statistic 
83(Mq + 1) (My + 2) rg1(Mg; — 1) 


U = x- ae 
(Mgq + 1) (%qq + 2) (Mgq + 1) (M32 + 2) 





(5-12) 


Using (5-11) we see that EU ~ EX? —2? ~ vari. (5°13) 


Thus U is an approximately unbiased estimate of the variance of . By comparison with 
(2-17) it is evident that the relative errors introduced are of order m?e-”, where m is n., 
OF Ng. For X and ji, we have similarly 


V 2 _ Ma1(%e1 — 1) (ng + 1) (ng + 2) 








is 5:14 
Mig ~ 1) (Mg, + 1) (tay +2) ne 
where EV ~varA, (5-15) 
and the relative error is of order n3, exp (—3,). Also 
Ss 82 29,(Ng; — 1) 
Tah 2%31\%31 : 5-16 
Oleg + 1) (Mag +2) sit 
where EW ~ var ji, (5-17) 


and the relative error is of order n2, exp (— M9). 


6. SUMMARY 


The maximum-likelihood estimate of the population size in simple recapture is discussed 
and shown to have a relative bias of order m-!, where m is the expected number of recaptures 
(rejecting cases when there are no recaptures). This bias may be serious even if the total 
sample size is fairly large. A slight adjustment of the estimate gives a quantity whose relative 
bias is of order e~”, and is therefore satisfactory for much smaller values of m. It is difficult 
to find an adequate expression for the population value of the variance of this latter statistic. 
Fortunately, however, these exists an estimate of this variance, with relative bias of order 
m*e-™, which is sufficiently good for many practical purposes. 

It is also shown that with inverse sampling there is an appreciable simplification—an 
unbiased estimate of population size and an exact value of the sampling variance are 
available, and are valid for samples of any size in the most general case of sampling without 
replacement. 

Birth- and death-rates have been taken into account by Jackson (1937, 1939) as exemplified 
in his ‘negative’ and ‘positive’ methods. The present paper discusses the maximum- 
likelihood treatment of both methods, which does not seem to have been given previously, 
providing in particular properly weighted estimates of the precisions involved. 

The data collected by repeated marking and recapture on several occasions can be exhibited 
in the form of a triangular array or ‘trellis’ diagram. This was first done by Dowdeswell 
et al. (1939). A general maximum-likelihood treatment of such data is extremely laborious. 
If we restrict ourselves to collecting data on three occasions only—the ‘triple-catch’ method 
—then explicit solutions for the population size and birth- and death-rates are possible. 
As with simple recapture approximately unbiased estimates of these quantities and approxi- 
mately unbiased estimates of the corresponding variances are also available. In view of the 
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simplicity of these results it seems that, in certain circumstances, it might be advantageous 
to make a special effort to collect data in just this form—concentrating on three occasions 
only—rather than producing a large trellis diagram which may be difficult to interpret. 


I am indebted to Prof. R. A. Fisher for his interest in this paper and for many stimulating 
discussions. 
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A MATHEMATICAL THEORY OF ANIMAL TRAPPING 


By P. A. P. MORAN 
Institute of Statistics, University of Oxford 


Leslie & Davis (1939) have developed a mathematical theory for the estimation of the total 
population of an animal in a given region from the records of a series of trappings by simple 
traps (each of which killed the animal) under the assumptions that there is no immigration 
or emigration and that the births and deaths from natural causes during the period of 
trapping can be assumed small. The purpose of the present paper is to provide a theory which 
in some respects is preferable, and to discuss its assumptions and limitations. 

We suppose that there are N animals i. 1 definite region in which trapping takes place on 
k successive occasions. These will usually take place at equal intervals of time, although 
this is not essential. If possible the position of the traps should be chosen at random on each 
occasion, but if the average region occupied by each animal is large compared with the total 
area divided by the total number of traps, this is less important. On each occasion the same 
number of traps are used, and we denote the total catch on the ith occasion by C; (i = 1, ..., k). 
Unless the size N of the initial population is very large, the sequence of C’s will tend to 
diminish because of the exhaustion of the population, and from this decrease we wish to 
estimate N. If the above conditions hold, and especially if the traps are placed at random 
on each occasion, it is not unreasonable to suppose as a first approximation that the pro- 
bability of any particular animal being caught on a particular occasion is independent of the 
animal and the occasion (ways in which this assumption can break down are discussed later). 
Let this probability be g. Then the expected number caught on the first occasion will be Nq. 
Writing p = 1—q, the expected number remaining will be Vp, so that the expected number 
caught on the second occasion will be Npqg. Carrying on the argument we see that the expected 


values of C,, ..., C), are No, Nap, Nop’, ..., Nap. 
Then log E(C;,) = log N + (t— 1) log p+loggq, 


where E(C;,) is the expected number caught on the ith occasion. As this means that log H(C;) 
is a linear function of i, one method of dealing. with the problem would be to replace the 
E(C,) by the observed values C,, take logarithms, calculate a weighted regression line on i 
using an estimated standard error for each log C;, and then find the values of N and g from 
this regression line. This method not only requires a lot of calculation but is not strictly 
valid, since successive values of the observed C; are not statistically independent. For 
example, if C, is larger than Nq, the expected value of C,, given C,, is (N — C,) q, which is less 
than Nqp. Leslie & Davis’s method is somewhat similar to the above, but involves further 
complications and suffers from the same objections. 

The following method is based on maximum-likelihood estimation and is much shorter, 
as well as providing easily calculable estimates of the standard errors of estimation. We must 
now assume that the probability of any given animal being caught is independent of the 
number of other animals caught. This assumption would break down if the expected catch 
was nearly as large as the number of traps, since each trap can operate only once on 
each trapping occasion. However, in the example considered later and in most practical 
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circumstances the assumption is probably well verified. If, then, at any stage there are 
N animals in the population, the chance of catching m of them will be given by the term in 


the binomial] distribution 
m —m 
(7) 2 _ 


If on the first occasion C, are caught, the population at risk during the second trapping is 
N —C,, and the probability of catching C, wi!. »2 


Caan 


Writing r, = > C;, we therefore see that the joint probability of catches C,,C,, ..., C;, is 
i 





N N-r N! k 
CypN-C, _ CeyN-re — ti pkN-E ri, 1 
(c.)2 P ( o, }2 ? Gl...Gl(N—ry!? ? (1) 


and this is the likelihood of the observed catches. Writing L for the natural logarithm of this 
likelihood and using Stirling’s approximation in the form 


logn = (n+ 4) logn—n+ constant, 


we find L = constant + (NW + 4) log N— N —(N—r,+ 4) log (N —1,) 
k 
+(N —1,) +7, log q+ (iw- zr) log p. 
i 


To obtain estimators of N and q we maximize L for variations in N and q by differentiating 
partially and equating the resulting expressions to zero. We then obtain 


k 


ae ee 2 
oL (N-1 7 -1 f 
ay = MN -(N —14)-}—log {(N -17y)/N} + blog (1—g) = 0. (3) 


It will be found in most cases that the term }{N~-1! — (N —1r,)~1} is small enough to be neglected. 
Solving (1) for g we can therefore write (2) and (3) in the forms 


q = oa 7. 3 (4) 
kN- > 7; 
Tr a 
and log (1-%) = klog (1-4), (5) 


where W and q are estimators of N and q. Equation (5) is, in fact, equivalent to equating the 
observed total catch r, to its expected value. Equations (4) and (5) now have to be solved by 
a process of successive approximation. One way to do this is to make a guess at N, then 
calculate ¢ from (4), and then a new value of N, from (5). By doing this for several values 
of N and plotting N, —N itis easy to find the value at which they are approximately equal. 
One method of estimating a suitable value of N for starting this process is to guess from the 
data the value of i for which C; is about one-half of C,, and take twice the value of r; as a guess 


at N. The process of solving (4) and (5) is much quicker than the use of a regression line. 


g is 


(1) 
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The asymptotic optimum properties of maximum-likelihood estimators in the present 
problem do not follow from the usual theorems because the sample size is itself a random 
variable but, as in other similar problems, seem to remain true because, as we notice from (1), 


k 
r, and > r; are jointly sufficient for N and g. To obtain standard errors for our estimates 
1 


we shall therefore assume these properties to hold and the variance-covariance matrix of 
(#7, 9) is then 





Lb _ eb | 
0q? ON 0g 
eL en ]’ (8) 
~@Noqg ON? 
OL —8 —2 -1 —1 
where Ne = {(N —1r,)? -— NN} + N1-(N—1,)7}, (7) 
oL k 
dNog 1-q’ ©) 
k 
OL _ (e¥-2>) Tk (9 
ag (1-g g® 


In (7) the first term can usually be neglected. 

Leslie & Davis discuss an actual example. An area of 22-5 acres in Freetown, Sierra Leone, 
was trapped on eighteen successive occasions over a period of six weeks. 210 traps were set 
in houses on each occasion in a manner which was effectively at random. The number, C,, 
of Rattus rattus captured on each occasion 7 is given in Table 1. The greatest number of 
other animals caught on any one occasion was sixteen. These figures show that on no occasion 
was the total number of animals trapped greater than fifty-nine, so that the assumption of 
successive binomial distributions seems a reasonable one. 














Table 1 

Trapping C; r, E(C;) 
1 49 49 39-32 
2 32 81 36-35 
3 31 112 33-60 
4 34 146 31-06 
5 16 162 28-71 
6 33 195 26-54 
7 22 217 24-53 
~ 27 244 22-68 
9 17 261 20-96 
10 19 280 19-38 
11 18 298 17-91 
12 16 314 16-56 
13 18 332 15-31 
14 12 344 14:15 
15 14 358 13-08 
16 12 370 12-09 
17 17 387 11:17 
18 7 394 10-33 





























310 A mathematical theory of animal trapping 
From the figures of this table, the equations we have to solve are 


e 394 
IRIS eget eral 10 
’ 18N — 4150 a) 
and log (1-3) = 18log(1—@). (11) 


We have omitted the term ${N-1 — (N —r,)~1} from equation (11), as it can be shown to have 
only a small effect on the solution. We can then, for convenience, take the logarithms to the 
base 10. 

To solve (10) and (11) a method of successive approximation was used. Trial values 500 
and 600 were taken for N , P was found from (10) and a new value of N found from (11). The 
difference between the new value of N and the old was plotted graphically against the old 
value to give a better guess. A further approximation gave N = 520,q = 0-0756, which was 
considered sufficiently accurate. Inserting these values in (6) gave 


74530 Boss 0-00008707 02819 
19-47 0-006013, =| 0-2819 1079)’ 


from which it follows that 
s.z. (N) = 32-9, s.z. ) = 0-00933, 


and confidence limits for N and 7 were therefore judged to be 520 + 66 and 0-0756 + 0-0187 
respectively. Leslie & Davis in their paper found 485 +74, but their calculation involved 
finding an unweighted regression line. Mr P. H. Leslie has kindly communicated to me 
a recalculation of his data using the estimated weights given in his paper, and this gives 
N = 504 with a standard error of 36. 

The expected values of C;, using N = 520, gq = 0-0756, are also shown in Table 1, and 
comparison with the observed values gives a y? of 17-42 with sixteen degrees of freedom, since 
two constants have been fitted. Leslie & Davis also calculated a y? from their expected values 
and found 16-79, which is smaller than the above. However, their method involved estimating 
another constant derived from the observed number of sprung traps not capturing R. rattus. 
Moreover, maximum-likelihood and minimum ,? estimators, although both fully efficient, 
do not necessarily give the same result. The closeness of these values of x? to their expected 
values shows that the data are not inconsistent with the hypotheses of either theory. 

If the theoretical assumptions underlying the above theory are not true the conclusions 
may easily become quite incorrect. This may happen in a number of ways, of which the 
following four are perhaps the most important. 

First, if the animals become trap-shy as a result of trapping the theory breaks down and 
the true population size is then probably greater than that estimated. There is evidence that 
trap-shyness does occur in some animals, and in the present data there is perhaps some slight 
indication of it in the rather large drop in the second catch relative to the first. 

Secondly, if there exists a positive immigration into the area formula (1) becomes incorrect. 
The effect of immigration is to make E(C;) decrease to a constant positive value instead of 
to zero. If we attempt to take account of this in equation (1) by supposing that the population 
is increased by a constant number M, say, between each trapping, the estimation problem 
becomes very complicated because the factorials no longer cancel. 
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Thirdly, if the population density or liability to capture is so large that most or all of the 
traps catch animals on each trapping occasion, the theory breaks down. We can no longer 
assume the number caught to be distributed in the binomial distribution because it is 
bounded above by the number of traps. The effect on the observations is to make the initial 
catches nearly constant and equal or nearly equal to the number of traps for as long a period 
as is necessary to reduce the total population to such a level that the catch is small enough 
relative to the number of traps to make the binomial distribution a reasonable approximation 
to the truth. The setting up of a theory to take this into account might be possible but would 
not be simple. Leslie & Davis do make the number of traps enter into their theory, but not 
in such a way as to avoid this difficulty. They suggest that it might be overcome by setting 
a number of traps close together at each trapping point. 

The last and perhaps the most important reservation about the above theory is that it 
assumes that the chance of being trapped is the same for each animal. That this assumption 
is vital may be seen by imagining an animal population part of which satisfied the assumptions 
of the theory and part of which was completely trap-shy. The observations would behave 
exactly as if the assumptions were true, and the estimate obtained would be an estimate only 
of the number which were not trap-shy, the number of the rest being unknown. Moreover, 
no test on the observed figures would show the existence of the latter part of the population. 
In any actual population it would, of course, be more reasonable to assume that there was 
a distribution of probability of being caught among the individual animals. Unlike the two 
previous difficulties, it does not seem possible to set up any statistical method of verifying, 
from the observed catches alone, the correctness of the assumption of equiprobability of 
being caught. 

Finally, it s hould be pointed out that most biologists now use live trapping, marking 
release and recapture for the estimation of population. 


Iam very much indebted to Mr P. H. Leslie for his help and criticism during the preparation 
of this paper, and to Dr Dennis Chitty for some helpful criticism. 
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SOME SYSTEMATIC EXPERIMENTAL DESIGNS 


By D. R. COX 
Statistical Laboratory, University of Cambridge 


1. IntTRODUCTION 


Consider the following experiment. It is required to compare the effect in processing of 
a number of treatments applied to wool. The wool is divided into lots as alike as possible and 
the lots are numbered in random order. In week 1 lot 1 is processed with a certain treatment. 
In week 2 lot 2 is processed with, in general, a different treatment. And so on. The age of the 
wool affects its behaviour in processing so that, superimposed on any treatment differences, 
there will be a smooth trend due to aging. We want to estimate the treatment differences 
as simply and accurately as possible. In what order should the treatments be applied? 
More generally, consider an experiment to compare a number of treatments on plots 


arranged in order in space or time, one treatment being applied to each plot. We assume that 
the measured quantity is 


(Treatment constant) + (Smooth trend) +(Random quantity), (1) 


where the random quantity has zero mean, constant variance and is independent in different 
plots. We want to arrange the treatments to give simple and accurate estimates of the 
treatment differences. 

One method is to arrange the experiment in randomized blocks, thus eliminating part of 
the trend from the error of the treatment comparisons. 

A second method is to randomize completely the order of application of the treatments 
and to eliminate the trend by analysis of covariance on position. The calculations can be 
extensive if high-order trends are involved. Also when the above assumptions hold informa- 
tion is lost by the use of the method. The loss may be serious if the number of treatments 
and number of replicates are small (see § 4-2). 

A third and related method is to repeat the treatments always in the same order, e.g. 
T,T,T,T,T,T; ..., and to eliminate the trend by analysis of covariance on position (i.e. by 
fitting polynomials). Neyman (1929) discussed this in connexion with agricultural field 
trials and gave certain tables. The calculations are again lengthy, and to simplify them Hald 
(1948) has introduced generalized orthogonal polynomials the tabulation* of which would 
greatly reduce the labour of fitting the successive polynomials. 

The discussion of a recent paper by Miss G. M. Cox (Cox, 1950) contains some comments 
on the use of polynomial fitting in the analysis of agricultural field trials. In particular, 
R. A. Fisher referred to his and Prof. van Uven’s early work on the subject and criticized 
the method. 


R. M. Williams (1949) has worked on a related problem in which there is no trend, but the 
errors in adjacent plots are correlated. 


* Hald gives tables in some special cases. 
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2. OUTLINE OF METHOD 


Let there be ¢ treatments, n replicates of each treatment and VV = tn plots in all. Let the plots 
be equally spaced and suppose that the trend can be represented by a pth order polynomial 


Oy Ei t+... +0, E5, (2) 


where £7, is the orthogonal polynomial of order for N equally spaced points given in Fisher 
& Yates’s tables (1948). To begin with the value of p is assumed known a priori. The popula- 
tion mean of the measurement on the jth plot is 


Pp 
a,t+ D &,£;, (3) 
#=1 


where a, depends only on the treatment T; applied to the jth plot. 

The estimation of the a; is easiest if the treatment differences are completely orthogonal to 
the trend. Now treatment differences are orthogonal to the linear part of the trend if and 
only if ’ ’ ° 

y s,=D,£,=0 (§=1,...,0), (4) 
where 2; denotes summation over all plots receiving treatment 7;. In general treatment 
differences are orthogonal to the complete trend if and only if 


8,=2,6,=90 (¢=1,...,t;4¢=1,...,p). (5) 


If (5) is true we may estimate the treatment differences as if the trend were absent. The 
variance of a treatment comparison is 20?/n, where o? is the error variance. It is shown in 
§4 that if (5) is not true the variance is in general greater than 20?/n. 

Therefore the problem is solved if we can arrange the treatments to satisfy (5) or, if this is 
impossible, to satisfy (5) as nearly as possible. 

We shall say that an arrangement of treatments such that (5) is true forms a design 
orthogonal to trends of order p. 


3. SOME EXACT SOLUTIONS 


When there are only two treatments and the trend can be represented by a cubic, some exact 
solutions are possible. The total number of plots, N = 2n, is even so that all the £; are odd 
integers. Now the sum of an odd number of odd integers cannot be zero, so that it is impossible 
to satisfy (4) for odd n. Therefore the number of.replicates, », must be even and equal to 
2m, say. Then &; takes the form 


=S =% +3} 1 g & 


There exist symmetrical designs in which if treatment T; is applied in position £; = r (called 
position r for short), the same treatment 7; is applied in position —r. 


Example. t = 2,n=4, N =8: 
—7 -6 -$ -1) 1.3.8, 7 
© 2 2 (45 8 © 


Since the odd-order orthogonal polynomials £5,,, (@ = 0,1,...) are odd functions, all 
symmetrical designs have treatment differences orthogonal to £5,,,. 
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The problem is therefore solved if we can find m positions from 1, 3, 5, ... to take treatment 
T,, such that Sie = 0, (6) 
where &; denotes summation over those positions in 1, 3, 5,... which receive treatment T;. 
There are no solutions for odd m, but there are the following solutions for even m: 
Eight replicates. t = 2,m = 4,n = 8, N = 16, p=3: 
G iopsg.e--ew ae aes 
(48 {SSRs 
(and the design formed by interchanging 7, and 7;). 
Twelve replicates. t = 2,m = 6,n = 12, N = 24, p = 3: 


fi Ry By eT. ST BD G....17 19, ..21 23 
Bh hh Hh Hh h-hh gs 
, 2 2S S'S Ska as 
eRe Se ee ee 
a a a ae a oe a ee a 
LEE engenaggy,sh sh 
RA RREKR ERE RRERS 


(and the designs formed by interchanging 7; and 7}). 

Of these the second is recommended for practical use because it is most nearly orthogonal 
to &%. 

There are probably solutions for all higher even m. 

There are also non-symmetric designs orthogonal to £; and £3. Examples are: 


Four replicates. t = 2,m=2,n=4,N =8,p=2: 
Se Se SF SF oe eee 
h%m%%|EAEhT YT, 
Siz replicates. t = 2,m = 3,n = 6, N = 12, p = 2: 
4 -ll -9 -7 -5 -3 -1 1 83 5 7 9 11 
1 nG,fnnrinrenrnanrnratnt f, 
Similar solutions exist for m = 4, etc., but are of no practical value, since it is better to use 


the above symmetric designs. 
To summarize, there are exact solutions for two treatments as follows: 


Cubic trend Quadratic trend 
n=8 n=4 
n=12 n = 6, etc. 

probably n = 4r 


A similar analysis for three and four treatments shows that there are no exact solutions 
for reasonably small n. Therefore it is necessary to consider designs which satisfy (5) only 
approximately. 
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4, GENERAL PROPERTIES OF APPROXIMATE SOLUTIONS* 
4:1. Solution by least squares 


When the treatment effects and the trend are not orthogonal, we set up least-squares 
equations for the estimates d,,...,4,, &,,...,%,. To do this it is convenient to introduce 
normalized orthogonal polynomials £,, such that 


1 =p), vi 


where the summation is over all plots. (The £, are simple multiples of the ¢’, tabulated in 
Pp 
Fisher & Yates.) We write the trend as ¥ «,¢,, and the sum of squares to be minimized is 
then oe 
t Pp 2 
p> di (x—a,- x aE) ° (8) 
i=1 z=1 
The least-squares equations are, after the use of (7), 
Le = b, (9) 


where L is the (¢+ p)th order square matrix 


nt, 6S (10) 
ss’; u) 


e is the column vector {4,, ...,4,, 2,, ...,%,} and b is the column vector 
{Z,%, :.., 2%, DE, x, ..., UE, a}. 


In (10) I; is the jth order unit matrix, S is the t x p matrix whose (i, “)th element is 2; £,,, and 
S’ is the transpose of S. (The general form of the equations can be seen from the special case 
(13) written out at length below.) 

The inverse of the matrix (10) gives the variances and covariances of the 4; and hence the 
variances of the treatment comparisons (4;—4,;). Before considering the general case it is 
best to examine two important special cases. 


4-2. Two treatments 


When there are only two treatments the matrix (10) simplifies considerably. Since 
XE, = 0, 2, 6,+ 226, = 0, so that 81, = — 8, = 8,, say. Thus 


n 0 Wee, eee 


0 n —8 we —8 


- Pp 
|L| = n?-2n > 8%. 


* §4-5 summarizes the main conclusions of § 4. 
Biometrika 38 21 
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Similarly, the cofactors determining var (4,), var (@,), cov (4,,@,) can be found and we get 


202 £2 i 
—&,) = ——fI~— ; ll 
var (4, —d,) n- (1 = | (11) 
Thus the minimum variance, achieved when all s, are zero, is the variance of the difference 
of two independent means. The efficiency £ is 


B=1-2 58 12 
= are he ( ) 


Example. Consider three designs for two treatments, four replicates and cubic trend 
(¢ = 2,n = 4, N = 8, p = 3). 
































Order Exact solution Best solution 
+4 & & selected at for quadratic orthogonal to 
random trend & and & 
a 7 on T, T, T, 
at 1 5 T, T, T, 
a" =¥ 7 T, T, T, 
3 =f 3 Ts Ts Ty 
1 —5 -3 f', | T, ig 
3 -3 =9 T. | T, T, 
5 1 —5 a T; T, 
1 7 7 T, T, T, 
De 4 4 0 0 
py HE 6 0 4 
pe 14 16 0 
8? 0-0952 0 0 
33 0-2143 0 0-0952 
83 0-7424 0-9697 0 
E 0-4740 0-5152 0-9524 























(For example s, = &,£,, 8? = (2X, ;)#/168.) 
From this example we conclude that 
(i) an order selected at random may be very inefficient; 
(ii) the exact solution for p = 2 is very inefficient for p = 3. 


4:3. Treatments orthogonal to all £,, except one 


Another important special case is obtained when the treatments are orthogonal to all 
£, except one, which we call £. The matrix S consists of zeros except in one column which 


we denote {s,, ..., 8}. It is worth writing out the least-squares equations involving the 4; in 
non-matrix form. They are 


na, +8,2 = Xa, 


: (13) 
na, +8,& = 2,2, 


8,4, +5,4,+...4+84,4 & = Déx. 





2) 


id 





3) 
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The matrix L is a simple bordered matrix and 
t 
|L| = n'—n'" ¥ 83. (14) 
i=1 


Similarly, the cofactor of the first element is 








so that var (4,) = 72 (15) 
{n! —wiys a| 
i=1 
Similarly, we can find var (4,) and cov (4,,4,) and we get 
t 
[en - nt-9(s8 +83+2> #) — 2n'-*s, 8, 
var (4, —d,) = o? C (16) 
n' —n'1 5) 8? 
i 
One important consequence is that 
P 20° 
var (4, —4,) > _ (17) 


with equality if and only if s, = s,. 

We can find the variance of any treatment comparison in the same way. One natural 
measure of the efficiency of the design is the average variance V over all possible comparisons 
of pairs. Any particular @; occurs in (t—1) of the comparisons and does not occur in 
(t— 1) (t—2)/2 of them. Therefore 











- 2 2n!-2 (-2 
F. —7 [an — ttt 1) + (t-1) (t- 2)) Est ze pee 
{nt —nt- Est] i>j 
1 
Now 8, = 0, so that 2 ¥ 3,8; = — s?. Thus 
sj 
v- 20? (n — (t—2)/(t- J 
n n— Xs? : 
and the average efficiency £ is 
vi ’ fy _ lt — 2) 2 e 
= {1—2Xs?/n} a “—a" : (18) 


This reduces to a special form of (12) in the special case ¢ = 2. 


4-4. General case 


Wenow return to the general case t + 2 and p + 1 (equation (10)). By a Laplace development 
of (10) on the first ¢ columns we get 


\L| wee oS (—1)'n'*(> | S® {3}, (19) 
k=1- 


where {p, t} is the smaller of p and ¢ and | S| is the determinant of a kth order minor of the 
matrix S, the summation being over all possible kth order minors, i.e. we can express | L | in 
terms of the sums of squares of all minors that can be formed from S. The cofactor of the 
first element of L can be expressed in exactly the same way in terms of the minors of the 


21-2 
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matrix S,,, formed by deleting the first row of S. This leads directly to a formula for var (4,). 
Similarly, cov (4,,4,) can be obtained in terms of sums of products of certain minors formed 
from S. 

Now in practice we are interested in designs which are nearly orthogonal, i.e. for which 
the elements of S are small. In this case, if p and ¢t are not too large, products of more than 
two of the elements of S can be neglected, and 

















|L|~ we ) 
nit—nt®? > 32.) 
i+1; 
var (4,) ~ a he , , (20) 
wis [on —ni-2 » (83, +83, + 281,89, + 22 fi) ; 
an fa, IL] eae 
Averaging over all comparisons of pairs, we get 
= 20%(  (t—2) a 1 |" 
ve n (t-SanE% . neuen : 
xs? (t—2) -1 
. . - a Up ae 2 
so that the efficiency E is E=|l - \{1 ST 2et,| ; (21) 


4-5. Summary of §4 


The conclusions of §4 about non-orthogonal designs can be summarized as follows: 

(i) Least squares equations (9) can be formed for the treatment and trend coefficients. 

(ii) From these equations the variance of any treatment comparison can be obtained. 

(iii) The solution is simple in the special case of two treatments (§4-2) or when the treat- 
ments are orthogonal to all relevant £7, except one (§ 4-3). 

(iv) The efficiency of the design can be measured in terms of the variance of the difference 
between two treatments averaged over all comparisons of pairs of treatments. The efficiency 
is given by (12) and (18) in the above two special cases. 

(v) The efficiency in the general case is given by a complicated expression which simplifies 
to (21) for designs that are nearly orthogonal. 

(vi) The greatest accuracy in estimating treatment differences is obtained for orthogonal 
designs. 


5. SOME APPROXIMATE SOLUTIONS 


In this section the application of the results of §4 is illustrated. Consider experiments with 
three and four treatments and suppose the trend can be represented by a cubic curve (p = 3). 
If we restrict ourselves to symmetric designs automatically orthogonal to £; and £3, the 
results of §4-3 apply. 

In recording symmetric designs, marked by vertical dotted lines, only the treatments for 
positive ¢; are shown. For example, the first design is in full: 


& —-ll -9 -7 -5 -3 -1; 13 579UN 
7’ 7AEOA ALAN A 
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In all cases further designs can be formed by interchanging 7,, 7,, .... The best designs are: 


Three treatments, four replicates (t = 3,n = 4, N = 12, p = 3): 
B4 l ie See aay Bee 
g | 35 -29 -17 1 25 55 Dé = 12,012 
‘eS & se ae et 
E16, = 40, Eaéy=—8, Baby = -32, 
Ds? = 2888, = 0-2238, 
Efficiency = 0-9712. 
Three treatments, six replicates (t = 3, n = 6, N = 18, p = 3): 


eg | 1 3 5 7 9 11 13 15 17 

& | —-40 -37 -31 -22 -10 5 23 44 68 

(i) ae Soo Rk G&A 7, 

ms & 4-2. 4 4:34.48 
(i) di) 


X46=-6 12, 

X62=—-6 —6, 

X3f= 12 —6. 
Efficiency = 0-9992, 


Design (ii) is recommended for practical use because it is the more nearly orthogonal to &;. 


Four treatments, four replicates (t = 4,n = 4, N = 16, p = 3): 
gi 1 3 Ta wee 
& : —21 -19 -15 -9 -1l 9 21 35 
oe oe. ee ee ee 
m2, = 28, 2,=4, 2,6,=-12, 2,2,=-—20, Efficiency = 0-9796. 
Four treatments, six replicates (t = 4,n = 6, N = 24, p = 3): 
ei l 3 5 7 > hf 8s nce 
4 ; —143 -137 -125 -107 -83 -—-53 -17 25 73 127 = 187 
ee Se ee ae a 
x,4,= 18, 2,=-18, 24é,=—-6, 2,é=6, Efficiency = 0-9999. 
The following designs are also useful: 
Three treatments, three replicates (t = 3,n = 3, N = 9, p=2): 
& —-4 -3 -2 —1 0 1 2 3 4 
& 2 7 -8 -19 -20 -17 -8 7 28 
@ 4 f fF, T, T, Ts 7, T% 7, 
O22 G fF, Y&R 
(a) 2, = —36, 2,é=18, 2,é, = 18. 
Efficiency = 0-8676. 


253 
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Exactly orthogonal to linear trend. Comparison of 7, with 7, orthogonal to quadratic 
trend. ; ; 
(6) 2,8,= 2, X12 = 0, 


Xf, = —3, Ue = 3, 
B= 1, Yyh=-3. 
Efficiency ~ 0-9584. 
It is hoped to give a fuller list of the useful solutions in a later paper. The method of 
construction is straightforward. For example, the design given for three treatments, six 
replicates and cubic trend is found as follows. Since the design is to be symmetric, treatment 


differences are automatically orthogonal to £; and £3 so that we need only consider £5. For 
the ‘positive’ positions we have the following values of £3: 


4 1 3 5 7 9 11 13 15 17 
4 —40 -37 -31 -22 -10 5 23 44 68 
(a) 4 & & &F 
(6) “4 | & £EEAAKh GT 
(c) T, T; Ts T, h§&EhaER F 
@) 6) ©) 
2,6 —-12 -6 12, 
X69 o «6 —6, 
X36 12 12 —6. 





The problem is to select from these nine positions three for each treatment so that the sum 
of the values of £5 over each treatment is as nearly zero as possible. A few minutes’ trial shows 
that an exact solution is impossible and that the three designs above are the nearest solutions. 
The efficiency by (18) depends on the sum of squares of the &; £3. Therefore (6) and (c) are the 
best solutions and by (18) their efficiency is 0-9992. 

The designs given above for p = 3 are the best symmetric solutions. It is possible that more 
efficient solutions could be found by considering non-symmetric designs. The analysis of the 
non-symmetric designs would, however, be a little more complicated and the designs given 
above are, for most practical purposes, completely efficient. 

A further useful property of some of the above designs is that certain treatment com- 
parisons can be made with full accuracy even though the design as a whole is not fully 
efficient. For example, in design (a) for three treatments and three replicates, the comparison 
(7, — 7;) is orthogonal to £, since Z,£ = Xf). Thus, for example, if 7,, 7,, T; are three levels 
of one treatment we can arrange that the linear part of ‘between treatments’ is fully accurate 
and orthogonal to the quadratic trend. 

Some of the designs (i = 3, n = 4; = 4,n = 4; t = 3, n = 3) turn out to be in blocks with 
one replicate of each treatment in each block, the arrangement of treatments within a block 
being systematic. Designs of this form were often advocated in the early literature of 
agricultural field trials. The present work shows that some of them are optimum provided 


that they are analysed correctly and that the assumptions on which the analysis is based are 
satisfied. 
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6. COMPARISON WITH OTHER DESIGNS 


The example in § 4-2 showed that the selection of a design at random and the elimination of 
the trend by least squares (analysis of covariance) may lead to considerable loss of efficiency 
in estimating the treatment differences. It is interesting to find the average efficiency over 
all possible designs. Suppose, first, that only linear trends are involved (p = 1), so that the 
analysis of § 4-3 holds. We have from (18) when (¢— 2) &s?/n(t— 1) is small compared with one, 





(22) 


where s; = &;£,. To average (22) over all designs we need the following lemma: 


Lemma. Let N = tn, where t,n are integers. Let x,,...,£) be N numbers such that Xx; = 0, 
Xa? = 1. Divide the x, at random into t blocks of n and let s,, ..., 8, be the sums of the x’s in the 
separate blocks. Then the mean value of s? +... +8? is (N —n)/(N —1). 

The result follows on enumerating all arrangements, or can be deduced from formulae on 
sampling from finite populations. 

Applying the Lemma to (22), we find that the average efficiency £ is given by 


a 1 
N-2 
= Wel" (23) 
In the same way the approximate result (21) gives in the general case 
_ (N-p-1l 
B= (“527). (24) 


Equations (23) and (24) are exactly true for two treatments. (These are average efficiencies: 
in unfavourable cases the efficiency may be appreciably less than this.) 

In the example given in §4-2,t = 2,N = 8,p = 3 and # = 4/7 = 0-5714. 

In general the gain in efficiency due to the use of systematic designs is only large when the 
order of polynomial is not too small compared with the number of plots. 

Another instructive comparison is with randomized blocks, ¢ plots per block. If there is 
a linear trend of amount bo per plot, then in the absence of treatment effects the measured 
quantity for the ¢ plots in any block can be written 


const. + €, + bo, const. + €, + 2ba, ..., const. + €,+ tbo, 


where €,, ..., 6, are independently distributed with mean zero and variance o*. Two plots are 
selected at random to carry treatments 7, and 7; and the average variance of the comparison 
of 7; with 7, is, for this block, 


6 [e,+ibo —e; —jba}?, (25) 
where (i,j) form a random sample without replacement from 1,...,¢. After some algebra 
(25) becomes 

t(t+1)b 
2 a atl ek en 
20 [1 + 12 “|, 
2 44 
and for n blocks the variance is a 1+ uty”) 0] J 





im 
giving an efficiency E= ( a ar | ; (26) 
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This shows, as would naturally be expected, that the steeper the trend and the larger the 
number of plots per block the less efficient is the randomized block design. These conclusions 
hold also for higher order trends. 

To sum up, we have compared three ways of arranging the experiment: 

(a) by a systematic design; 

(6) by complete randomization and the use of covariance analysis to eliminate the trend; 
and 

(c) by randomized blocks. 

We can nearly always find a systematic design of efficiency very near unity, i.e. for which 
the variance of a treatment comparison is only slightly greater than 20?/n. The second 
method gives an average efficiency (24) independently of the amount of the trend. The third 
method gives an efficiency (26) decreasing as the amount of the trend increases. 

For the analysis of the systematic designs to be valid we must be able to assume from our 
physical knowledge of the experiment-that the measured quantity is formed by the addition 
of polynomial trend, treatment effect and completely random variation. This is not necessary 
when the design is randomized. 





7. ANALYSIS OF RESULTS 


In this section we show briefly how the results from one of these experiments should be 
analysed. The assumptions on which the analysis is based are stated at the end of §6. The 
method of analysis is a standard form of the method of least squares (analysis of covariance). 

First, consider the exact designs for two treatments given in §3 of the paper. We can form 
an analysis of variance as follows. The treatment difference and sum of squares are obtained 
as if there was no trend. The trend coefficients and sums of squares are obtained as if there was 
no treatment difference. The residual sum of squares is obtained by subtraction: 











D.F 
Treatments 1 
Trend Pp 
Residual 2n—p—2 
Total 2n—1 














The residual mean square s? estimates the error variance o?, and the estimated standard 
error of the treatment difference is s(2/n)*. If the residual variation is normal the usual exact 
tests of significance hold. 

In other cases the analysis is not as simple because trend and treatment effects are not 
exactly orthogonal. We can set the calculations out in the form of analysis of covariance or 
in the form of least squares theory. We can illustrate the latter method on the particular 
case of ¢t treatments with cubic trend, the treatment differences being exactly orthogonal to £5 
and £3. We calculate sums of squares for ), £3, and £3 asif there were no treatment differences. 
We set up the least squares equations (13) for the joint estimation of the coefficient of £3 and 
treatment effects. (The equations are in a form suitable for easy solution by iteration.) We 
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thus get both the best estimates of the treatment effects and the sum of squares for fitting 
£, and treatment constants. We have the analysis of variance: 














8.8. D.F. 
Fitting £5 and & U, 2 
Fitting treatments and &} U; t 
Fitting & Uz 1 
Treatments U,—U, t-—1 
Residual 0,0, nt—t—3 
Total U, nt—1 

















The residual mean square estimates the error variance o?, and the standard error of any 
treatment comparisons can be estimated directly from equation (16). If the distribution 
of the residual is normal, exact F and t¢ tests can be made in the usual way. 


8. APPLICATION OF DESIGNS 


It has been assumed that the order of trend p is given a priori. This is not normally true in 
practice, so that it is usually necessary to select a design with good properties for the largest 
p likely to occur. This value of p will depend on the nature of the experimental material and 
on the number of plots. 

The solutions given above can be made the basis of more complicated designs. For example, 
the experiment may be repeated in a number of blocks, a trend being estimated from each 
block. 

In the discussion it has been assumed that the main object of the experiment is to investigate 
the treatment differences. In some applications, such as that mentioned at the beginning of 
this paper, the trend itself is of direct physical interest. The same type of argument as used 
in §4 shows that the above systematic designs give optimum estimation of the trend. 


SUMMARY 


Systematic experimental designs are given for use when a number of treatments are to be 
compared, one treatment being applied to each of a number of equally spaced plots. It is 
assumed that there is a smooth trend between plots and that the ‘error’ is independent in 
different plots. The designs enable the treatment effects (and the trend) to be estimated as 
simply and accurately as possible. 


Sincere thanks are due to Mr R. C. Palmer, Mr F. J. Anscombe and Dr J. Wishart for 
helpful discussions. 
REFERENCES 

Cox, G. M. (1950). Biometrics, 6, 317. 
Fisuer, R. A. & Yates, F. (1948). Statistical Tables for Biological, Agricultural and Medical Research. 

Oliver and Boyd. 
Hap, A. (1948). On the Decomposition of a Series of Observations. Copenhagen: Gads Varlag. 
Neyman, J. (1929). The Theoretical Basis of Different Methods of Testing Cereals. Part II. Warsaw: 

K. Buszezynski and Sons Ltd. 
Wi1ams, R. M. (1949). Unpublished Ph.D. Thesis, Cambridge University. 











[ 324 ] 


THE COMPARISON OF SEVERAL GROUPS OF OBSERVATIONS 
WHEN THE RATIOS OF THE POPULATION VARIANCES 
ARE UNKNOWN 


By G. S. JAMES 
University of Leeds 


1. INTRODUCTION 


B. L. Welch (1947) has described a method for comparing the means of two normal popula- 
tions, when the ratio of their variances is unknown. The same method can be used to compare 
two regression coefficients. In the present paper we derive by similar methods a test for the 
equality of k = (r+ 1) means, or regression coefficients. 

Let 2,...,%, be any quantities which are distributed independently and normally with 
means /4,,..., 4, and variances «,,...,%, respectively. Let a; (¢ = 1, ...,k) be estimates of the 
a, based on v; degrees of freedom, the a; being distributed independently of the x; and of each 


other in the forms ae; 
-l “ail ral MiGs) 5 (ems , 
TGr,) \ 2a, P\~ 90, ) *\2a,)° (1) 


(In the case of the comparison of means, the x; will denote the means of the groups, while 
the «, and a; will denote the quantities called o?/n,; and s?/n; in the usual notation. If we are 
comparing the slopes of several regression lines, the x; will denc’e the estimated slopes and 
the «; the variances of these estimates. The principle can obviously be extended to more 
complicated situations, such as testing whether several regression lines have a common 
point for a specified value of the independent variate.) 

If the «; were known a priori, we could test the hypothesis that ~, = ... = 4, by using the 
fact that, when it is true, 











Lu, (x; —Z)? = Lo, x} — (Lw;,x;)?/w (2) 


is distributed as y? on r = (k—1) degrees of freedom, where w,; = l/a;, w = Zw; and 
% = Lw;2z;,/w. That is to say, 


Pr (Lo; 27 — (20; %;)?/w < 26] = G,(), (3) 


where 2£ denotes a particular value of x? (such as the 5 % point), p = 4r = $(k—1) and 
@() = | * plete (4) 
ee’ Dip) Jo : 


When the «; are not known, but the v; are large, we may continue to use the test described 
in (2) or (3) asa ‘large-sample’ test, replacing w, by w; = 1/a; and w by w = Lw,. If the v; are 
not quite large, this procedure will not be very accurate, and a ‘Studentized’ test which takes 
into account the sampling variations of the a; is desirable. Following Welch, we shall attempt 
to find a series development of a function h(a, ...,a,; §)=h(a) of the variance estimates 
a, and of £ only, such that in place of (3) we shall have 


Pr [Zw,a} — (Zw42,)?/w < 2h(a)] = G,(é). (5) 











SS OS 
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It will be observed that on the left-hand side of the inequality in (5) we have replaced the 
true weights w; occurring in (2) and (3) by estimated weights w;. The expression could 
therefore be written as Xw;(x;—Z)*, where now denotes Lw,2,;/w, but we shall not use this 
form, as it is likely to conceal the fact that % depends on the w,. 


2. PRELIMINARY ANALYSIS 


From (5) we have fer [2w, 23 — (Zw, 2,)?/w < 2h(a) | a] Pr [da] = G,(£), (6) 


where the first expression denotes the probability of the relation indicated for fixed a; (and 
hence for fixed w;), and the second denotes the product of the probability differentials (1), 
the whole being integrated over the range 0 to oo of all the a;. Now, by Taylor’s theorem, we 
have the symbolic expansion 


Pr [2w, 2} — (Zw, %;)?/w < 2h(a) | a] = exp [X(a; —«,) 0,] Pr [20,23 — (Zw, %,)?/w < 2h(a)], (7) 


where, after expansion of the exponential, 0; denotes 0/0«,;. It should be carefully noted that 
the symbols w; and w are used in this work merely as abbreviations for certain functions of 
the a,;, so that the operators 0; apply to them just as they do to the a, appearing explicitly. It 
is also necessary to notice that the notation Pr[...] used in (7) does not explicitly mention 
the distribution of the x,, which itself depends on the «,; this distribution is not, of course, 
supposed to.be affected by the operators 0;, so that if we were using a fuller notation it might 
be necessary to use some device to distinguish between those «; which were to be differentiated 
and those which were not. Substituting (7) into (6) we obtain 


G,(€) = [ exp (2(a,—a,) 2 Pr [da] Br [Zeya — (Soyx,)#/0< 2H(a)] 








= @Pr (Zw, 23 — (Zw, 7;)?/w < 2h(a)], (8) 
— py 
where @=T1 [e~ (1 - ec “| ‘ 
i 
“i a20? (4 0802 1/a202\? ™ 
=1+4+2 Y; +5245 ( Y; )}+0w ). (9) 
This result follows on substituting for Pr [da] from (1), exactly as in Welch’s paper. 
Now let h(a) = ho(a) +h,(a) 


where h,(a) is of order v~*. The h,(a) will be determined successively in the sequel. Introducing 
a further symbolic expansion, equation (8) now becomes 
G,(€) = Oexp [h,(«) D] Pr [2o,23 — (Zw ;2;)?/w < 2ho(x)], (11) 


where, after expansion, D denotes differentiation with respect to the expression h(a) 
appearing on the right-hand side of the inequality—a function which will prove to be just &, 
as might be expected. Substituting for © and h,(«) from (9) and (10) we obtain, to the 
second order, 


ag) = [1495 (Et 5 (BSA) [ta + h,(a) D+ (ala) D+ 4Ha) DYIPrL..} 


(12) 














326 Comparison of several growps of observations 


Equating terms of successive orders, and using (3), we find 
ho(x) = &, (13) 
a2 o2 
[ mia) D+] Pr [2,23 — (Zw; %;)?/w < 2£] = 0, (14) 
i 


Ke D+ thi(a) DP+2 a (AE (xe) + Zh (x) 8; + hy (x) 02) D 


: = 45 (nS) ]erc. j=0, (18) 
where A{(a) = d%h,(a), ete. 


Itnow only remains to carry out the differentiations indicated in (14) and (15) (remembering 
that D = 0/0) in order to obtain h,(«),h,(~), and thence h,(a), h,(a). At the corresponding 
point in the problem of comparing two means, the 0; operators only had to operate on simple 
functions, but here they have to operate on Pr [Xw,x? — (Zw; %;)?/w < 2¢]. This expression is 
a k-ple integral, and the operation may be thought of as differentiation, with respect to the 
boundary only, of the integral of the probability density function of the x; throughout 
a region in k-space. The method adopted to evaluate these derivatives is to change the 
boundary slightly, expand the integral in powers of the quantities specifying this change, 
and obtain the derivatives by comparison with Taylor’s expansion. 





3. EVALUATION OF THE DERIVATIVES 
2 .\2 
We have Pr [(2w, 7? — (Zw; 2;)?/w < 2] = Pr [=z (2%) [zz < 2. (16) 
i i i 


Now replace a; by ~,;(1+6,) in (16); we then have 


yer 2 ota Page) [Parra 














8 1 
= [1+ Ba 6,8, +5 Bae, 2) +. . |Pe[ 2 -(2 =) [=<], (17) 
by Taylor’s theorem. But we also have 
2 
= (27)-#* Naz exp (- 122!) dz, (18) 
R m" 
where f ... dx is an abbreviation for J... f ...dx,...da, and R, denotes the region 
a X; ) / 1 
agi ray” Bagh vay) | Paqnsey<% a 
Making the transformation x; = (2a,)*y,;, (18) becomes 
J=1¥/ exp(—Zyi)dy, (20) 
R 
3 1 
R=>d yi -(& Yi ) / < 
1+é; a} (1 +<;) Al +e) +€;) 2 (21) 


Tracing back the origin of the quadratic form in (21), or directly, it is seen to be of rank 
r = k—1. Let its matrix be A, so that the inequality in (21) can be written as y’'Ay < é. We 
can now find an orthogonal transformation y = Lz to new variables z;, which is such that 


L’AL = diag (1+7,,...,1+4,, 0). (22) 











3) 


4) 


———EEe 
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(20) now becomes 


k r 
J= rie exp (- > #) dz = rel exp (- >» #) dz, (23) 
U(1+m)2i<é 1 (1+ m4) 24<E 1 


on integrating out z,; dz now denotes dz,...dz,. Now make the further transformation 
z, = (1+7,)-*u,, and write 





(l+9)7=1-¢, (¢=1,...,r). (24) 
Then J= moTi(1—£) | exp [—2(1—€;,) uz] du 
cusy<é 
A an Cr 
= TAGS Fi [aa te rep Eudu. (25) 
by, «= br= 0.91! ...5,! J supe 


The integral is of Dirichlet’s type and is easily evaluated by making a polar transformation, 
for example, yielding 





= ihe or (6, 3) ste" A (6 4) 5 : = 
om tr ani ) rt 2b; +4r—1 pt 
J =n-¥TI(1—;) uF me T(35, +47) - rl et dt 








@) b;- 
= (l- i wi z) ol —4) chi #4208) ; 
Be: & 
= a(o8 A)" 4,6) (26) 
where £ is an operator with the property H°G,(£) = G,,,(€). We also define A = H—1. Then, 
using (24), (26) becomes 
J= I (1—9;,A)~* (6), (27) 


a remarkably simple symbolic form, but one which is unfortunately not so simple when 
expressed in terms of the ¢;. 
Now, since L is orthogonal, (22) yields 


L’[(1+A)I—AA]L = diag (1—7,A,...,1—y9,A,1+A). (28) 
Taking determinants, and inserting the typical element of A by inspection of (21), we obtain 
(1+ A) II(1—9,A) = | (1+A)I-AA| 








A \ A 
= |(144-795) 00+ capa aTey aire) 
=| 7:5;5+9:9 | 
k 
=P; Pal + Zain), (29) 
where 6;; is the Kronecker delta and 

1+e,(1+A) “ A / 1 
a . = —___ / 5 —_—_., 30 
‘——“T4e, °° @aiite)i/ ~a(l+e) as 


Substituting for p; and q? in (29), we have 


el A 1 
arayifa-na) =f] [+2 asetieaasanl ane) 
3 8, Bi 1+e(1+A)]+ [2o,/[1+6(1+A)]+ 
na wri ite | “Ll Zos+e) | ° 





(32) 
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It is now necessary to expand this expression in powers of the ¢;, and let it operate on 
G(g) in accordance with (27). It is to be noted that 


AG,(£) = eter (33) 
where 9,(§) = G(é) = To? es, (34) 


Comparing the resulting expansion of J with Rion in (17), we finally obtain, after a good deal 
of algebra, 





l 8 
0,9; Pr[...]=5(1-“4)g,43(8), (35) 
2 w 
292 Pp crt w;\? 
ai; r[..J=— | —— (3£ + 1)9,4:(8), (36) 
a3 Pr [.. 1=5(1-2) (5H? + 2H +1)g,,,(é), (37) 
att Pr[.. 1=-7(1-2) (35H + 15H? + 9E +5)g,,,(é), (38) 
2 2 
aza?ee?Pr[...] = [ - yg (1-22 EH) (9E3 — 3E? —5E —1) 
~77 1 (278 + 3B2 4 B+ 1) 42% 4+} (45054 OR? + 7B +3) 
— St (a5 + 15B* +98 +5) Jy, aa) (i+). (39) 


It may be noticed that (39) does not reduce to (38) when i = j. 


4. COMPLETION OF SOLUTION 
Substituting (36) into (14) we obtain 


hy(q) = i(3 ag oe z;, ( ek (2) 


Differentiating (40) twice with respect to «;, and substituting these results, together with 


(35)-(39), into (15), we finally obtain, as the approximation of order —2 in the v,, the 
expression 


2h(a) = x? + 3(3X4+X2) = (1-2 y+ + | fe(Bxa+ x2) a *) (= ‘(-")') 


V; 








+ 3(3X%q+ Xa) [(8Ry3 — 1ORg. + 4Ro, — 6Ri, + 82. Ry, — 4Rj,) 

+ (2Ryg— 4Ryy + 2K, — 2A i, + 4Ryp Ry — 2Rj,) (X2—1) 

+ }(— Rip + 4Ryy Ry, — 2Ryy Ryo — 4Rj, + 4Ry Ryo — Rig) (3X4— 2X2 — 1)] 

+ (Rog — 3Ryg + 3Ry, — Ryo) (5X6 + 2X4 + X2) 

+ (Riz — 4Rgg + 6 Roy — 4Ro, + Rog) (35xXg + 15NX6 + 9X4 + 5X2) 

+ 7e( — 2Ryq + 4Ry, — Rog + 2Ryg Ryo — 4Ry Ryo + Rig) (9Xs — 3X6 — 5X4 — Xe) 
+ 4( — Roy + Ri,) (27X5 + 3X6 + Xa + Xo) 


+ (Rog — Ryg Ry) (45x + 9X6 + 7X4 + 3Xq)) + O(V-*). (41) 


ee 
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: P " 1 /(w;\' 
In this expression we write R,, = U= (2: P (42) 
Vi \w 


while in order to facilitate printing x., is written in place of y**/(k—1)(k+1)...(k+2s—8). 
If we only wish to go as far as the term of order — 1 in the v;, this result may conveniently 


be written e \2 
2h(a) = [1 + en 2s (1 -*4) +0-)]. (43) 


To summarize, then, to test the significance of the differences between 2,,...,7,, we 
calculate the statistic Zw,23 — (Zw,2,)%/w = x2, say. (44) 
If the v; are all large, this can be taken to have the x? distribution with (k— 1) degrees of 
freedom, so that if x? denotes the value appropriate to a given probability P, then 
Pr [x2 < x”] = P. However, if the v; are not all large, we must not compare x? with the 
percentage point x”, but with another statistic 2h(a), of which (41) is an expansion in inverse 
powers of the v;. 

The actual manner in which (41) ‘converges to the true function 2h(a)’ is not known, 
but it is hoped that the use of at any rate the first corrective term (incorporated in (43)) 
may result in a test which is a good deal more accurate than the large sample y? test, 
at any rate for moderately large v;. This is known to be so in the case k = 2 considered by 
Welch. (See, for example, Aspin (1948), table 1.) In any case, to include the second corrective 
term would involve a good deal of numerical calculation. 

It should, perhaps, be mentioned that the result (41) can be shown to agree with Welch’s 
result when k = 2. In this demonstration it has to be noted that certain non-linear relations 
hold between the quantities R,, in the special case k = 2. 


The author wishes to express his thanks to Mr A. M. Walker for much helpful discussion 
on different methods of dealing with problems of the kind considered in this paper. It is hoped 
to publish in a subsequent paper methods for dealing with more general linear hypotheses 
than the one considered here, and also with some multivariate problems. 
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Note added in proof. It should be pointed out that (5) is by no means the only possible 
way of modifying (3) to obtain a test for smaller samples. We could attempt to find 
— W, (a) = Ila, + ja (a) +--] 
such that Pr [2 W, a3 — (ZW, x,)?/W < 26]=G,,(£), 
the j;, being O(v~). It then transpires that the j;, have to satisfy the single equation 


A toa ee 


If we look for a solution of the form j;, («)=a,/(a), we are led to our original test (43). 
Another obvious solution is obtained by setting each term under the summation sign 
equal to zero. 
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ON THE COMPARISON OF SEVERAL MEAN VALUES: 
AN ALTERNATIVE APPROACH 


By B. L. WELCH 
University of Leeds 


1. INTRODUCTION 


Let y, (¢ = 1,2,..., 4) be statistical quantities normally and independently distributed with 
means /, and variances A,o?, respectively, where the A, are known constants but nothing is 
known about y, and 07. Suppose that the data provide estimates s? of the o? which are 
distributed, respectively, as y?0?/f, where f, is the number of degrees of freedom of 43. 
Suppose, furthermore, that all the s? are distributed independently of each other and of all 
the y,. The problem considered here is then to test whether the data are consistent with the 
hypothesis that all the y, are equal, i.e. w, = mw (t = 1, 2,...,k). 

As a particular case y, may be the means %, of samples of n, observations from k different 
normal populations whose true means and variances are 4, and o?, respectively. Since V(Z,) 
is o?/n, we have A, = 1/n,. The separate samples provide estimates s? of the o?. The hypothesis 
under review is then whether the populations sampled may be considered to have the same 
mean without, however, imposing the condition that their variances are necessarily equal. 

The problem has already been discussed by G. S. James (1951) elsewhere in the present 
journal (pp. 324-9). In my notation, which is slightly different from his, he considers the 
statistic Lw,(y,—9)*, where w, = 1/A,s? and 9 = (Xw,y,)/(Xw,). When all the , are equal this 
quantity is, in large samples (i.e. provided f, are large), distributed as x? with (& — 1) degrees of 
freedom. If x3 is the value of y? exceeded with probability P, we shall then have 


Pr [ZX wi(y—9)? > xb =P. (1) 


For samples not large enough for this to apply James considers the problem of finding 
@ function h(w,, w.,...,w,,P), which may be written h(w) for brevity, and which has the 


property that Pr b> w(y,—9)2 > h(w)] = P. (2) 


He develops a method of deriving successive approximations to such a function bringing 
in, in turn, terms of order 1/f,, 1/f? and so on. To order 1/f,, for instance, he finds 


iB 3xpt+(k+1) 51 be 
h(w) = xp | 14 HERETO 5 (1- se) |. “ 


James also gives explicitly the term of order 1/f? but, as he points out, this is so complicated 
as to discourage the view that it is likely to prove of much practical utility; nor does it 
appear to be worth while to carry through the heavy algebra needed to produce the terms 
of still higher orders. However, a procedure based only on the terms to order 1/f, is quite 
easy to apply and, provided the f, are not too small, is likely to be an appreciable improve- 
ment on the large-sample assumption that ymy—9? follows a x? distribution. It may 


therefore be of interest to derive the result given in (3) by an alternative route. This I propose 
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to do in the present paper. More specifically, I shall first develop an approximation which 
involves the use of the variance ratio tables rather than the x? tables. But it will be seen 
that, to the order 1/f,, this is equivalent to the procedure given by James. 


2. THE SPECIAL CASE, k = 2 


Before proceeding it will be convenient to recall certain results for the particular case k = 2 
which has already been discussed fairly fully by myself (Welch, 1947, 1949) and Aspin 
(1948, 1949). In this case we have 


: ave _ UU 2(Yi— Yo)? (Y1 — Ye)” 

Nw Se 1" 271 ~~ = a abt mm. BS (4 

mel 9) (w, + wy) (A, 83 + Ag 83)’ \ ) 
i.e. Swulu-9? =v, where v = (y,—Y2)//(Ay8}+A289)- 


Now when ys, =“, the numerator of v is a normal deviate with standard deviation 
/(A,o} +A, 03). The denominator of v is an estimate of this standard deviation obtained by 
replacing o? and o3 by the sample variances s? and s3. Welch and Aspin have made some 
progress towards tabling critical values of a quantity v defined in this way. At the same time, 
however, I have also given some consideration to an alternative approach which requires 
only the already available tables of the ‘Student’ distribution, and which furnishes an 
approximation which numerical work suggests to be very useful. This alternative approach 
stems from the replacement of the true distribution of (A,s%+A,s3) by a Pearson Type III 
curve with moments suitably adjusted. I was able to show (Welch, 1947) that the errors 
involved in making probability statements about v on the basis of this particular approxi- 
mation were of order 1/f?. 

The use of standard types of frequency curves to approximate to awkward sampling 
distributions is, of course, a common device. It is natural, therefore, to ask whether it has 
any useful application in the more general problem of making probability statements about 
pmu-9? when k may exceed 2. It is to this question that we now turn. 


3. THE MOMENT-GENERATING FUNCTION OF > w(y,;—9)? 
t 


We have seen in equation (4), above, that, when k = 2, }) w,(y,—9)® separates into a pure 
t 


function of the y, divided by a pure function of the s?. When k> 2 a like separation does 
not take place. It appears, therefore, that we have to approximate the distribution of 
¥ w{y,—9)? directly without being able to make it depend on a function of the s?, alone, 
t 


which has simple moments. Now the moments of } w,y,—9)* are not simple and, indeed, 

after a certain order they become infinite. Nevertheless, we shall proceed formally as if the 
moment-generating function, M(w), of ~ w(y,— 9)? existed, writing 

M(u) = Hexp[u > w(y,—9)"), (5) 

where E denotes averaging over the joint distributions of y, and s?7. This can also be written 

M(u) = E, E,exp[u ~ w(y,—9)?), (6) 
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where FH, denotes averaging over the joint distribution of the s? and H, denotes averaging 
over the joint distribution of the y,. 


It will be recalied that w, = 1/A,s?. Similarly, we can denote 1/A,o? by w,. With this 
notation, expanding symbolically, we have 


#,exp [wu Dwoly— 9)*] = Dexp[u Daly y)"), (7) 


where ® = E,exp ix (w,—«,) Dj}, (8) 


o - 
Dp= (5) and 9 = Bay) /(Ex). 
Before ® can be interpreted it must, of course, be expanded in powers of D,. This gives an 


infinite series in D,, but if we proceed only to a certain order of 1/f,, we have a series which 
terminates. Thus to the first order it is seen that 


= 142E et (9) 


t t 





In the present paper we shall work only to this order. Carrying through the differentiations 
we then obtain 


M exp [u > u(y,—Y)"] = exp [u ~ o(y,—¥)*] 





w(y—¥)?_ _2u eine a 
[14 any DE Soke Dow—Mlh--- | (00) 
where, now, 7 = (2 ol ( w,). We then see from equations (6) and (7) that, in order to 


obtain M(w), we must average the right-hand side of (10) over the joint distribution of the y,. 
Since we are at the moment only concerned with what happens when the y, are equal, 


and since (10) does not depend on the origin of co-ordinates, we lose no generality if we 
take uw, = 0, i.e. 
” P(Ya» Yas +++ Yu) = (2m) (Wy Wy... Wy) e-PHa, (11) 


We then have from (10) and (11) 
M(u) = fem (w, 0, ... o)texp[—} > oyitu Luln-Iy 


x[ 142m {1 24) yaar Doni [dnd (12) 
th Le, t 
Using again the symbolic expression for Taylor’s series this yields 
w 
Mu -v¥[1+2 BS 1s ot 2 “t, 13 
( ) U ; fh, Lu, t U ps F 3 ( ) 
where 
Y= Jen (1, ... o%)texp[—4 - yi tu < o(y,— 9) + x(n%—-9) 6] dy, ...dyy, (14) 


and 4, denotes differentiation with respect to v with v put equal to zero when all differentiations 
have been carried out. On integration 


Y = (1—2u)-—Dexp [4a —2u)-1 (zz ml (15) 





a TT TTT 
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13) 
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it being understood that before this can be interpreted the exponent must be expanded in 
powers of ,. In virtue of the nature of the expression on which ¥ is to operate only a few of 
the terms in this expansion are needed. We find that 


Wo} = (1—2u)-4e-D [a 2 au) (s a omit - (16) 
Wot = (1—2u)-KeD [sa ~2u)-2 Fr ‘ sail | (17) 


whence from (13) 
O 


M(u) = (1 2u)-*#*-9 [: + {2u(1—2u)-! + 3u2(1 — 2u)-*} (E5 ( ~ =) |] ; (18) 


Working, therefore, to order 1/f,, the corresponding cumulant-generating function of 
Dm(Y— 9)? is ; 
K(u) = —4(k—1) log (1 — 2w) + {2u(1 — 2u)-1 + 3u?(1 — 2u)-7} (z5 ( - . (19) 
t fi Lo, 


4. GRADUATION BY A PEARSON TYPE CURVE 


At this stage we may, if we wish, invert (18) term by term to produce a distribution function, 
but at the present moment we are more concerned to see if one of the Pearson curves can be 
used in this connexion. The obvious one to try is that followed by the variance ratio F, since 
the F distribution is the one used in the simpler problem of comparing mean values when 
the ratios of the population g,;’s are known. Let us, therefore, consider the moment- 
generating function of the F distribution. (Again we shall proceed formally, since strictly 
the moments of F after a certain order will become infinite.) 

We have F = (x3/f;) = (x3/f.), where x? and x2 are distributed independently as x? with 
degrees of freedom f, and f, respectively. Now 


Beth = (1-2ulf}th. (20) 
s 2uf,, tf 1 
Therefore, for given x3, we have He"¥ = (1-— fal - (21) 
1A2 


The moment-generating function, M,(w), is then given by averaging (21) over the x3 dis- 
tribution. This does not give anything simple, but at this point it should be noted that, if 
the F distribution is going to be useful, we would expect from large-sample considerations 
that f, would have to be of the same order of magnitude as the f, of the earlier sections, and 
that f, would differ from (k—1) at most by a quantity of the order 1/f,. Let us write down 
M,(u), therefore, correct only to order 1/f,. Doing this we have, after averaging (21), 


M,(u) = (1—2u/f,)-# [2 + ral —2u/f,)2+ Awa = 2ulf | ’ (22) 
2 


Next let us write f, = (k—1)* and let @ = [(k—1)+A/f,] F. Then to order 1/f, we obtain, 
from (22), 
A+2k-1) 


M,(u) = (1—2u)-#*-D [2 +! u(1 —2u)* + ro w(1—2u)*}, (23) 


fh f 


(2-1) 
fi 





- 
fe 


* f, = (k—1)+B/f, was also tried but does not lead to a more general result. It is seen that B must 
equal zero. 


u2(1—2u)-*. (24) 





ie. Kg(u) = —4(k—-—1)log(1—2u)+ (A + 2h—1)u(1—2u)*+ 


22-2 
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Comparing this with the cumulant-generating function of ¥ w,(y,—9,)? given in the last 
t 


section, we see that we can bring the two into agreement if we write 








PH) = 33; (1-gt) ; 
and (4D apt (i-gty’ (25) 
i.e. 7 = ey Bs (1 -st) . 
ira jest) 35'S) 


To order 1/f, it appears, therefore, that Xw(y,—9)? is distributed as {(k—1)+A/f,} 
times F, where f, = (k—1), and f, and A/f, are given by (26). 
5. APPLICATION TO PROBLEM OF TESTING /J, = 4 


The result in the last section may be restated as follows. Suppose we define a criterion 


Sly HFk—1) 


2 mes 
v gp Reine pT (27) 
(k?-1) Tf, Lo, 
and suppose Ff, is the tabled value of the variance ratio F', exceeded with probability P, for 
degrees of freedom f, = (k—1) and 


f= [gay Py(t-za) | 


Then, when 4, = “, we have approximately (i.e. to order 1/f,) 








Pr (v?> Fp) = P. (28) 


It will be seen from (27) that v? involves the unknown quantities w, (= 1/A,0?) as well as the 
sample values w, (= 1/A,s?). The unknown w, also enter into f,. We cannot, therefore, use 
(28), as it stands, to furnish a test which can be applied in practice. However, since the 
,, in each case, enter only into expressions of order 1/f,, the substitution of w, for w, where 
they occur will still leave us with a probability statement (28) which is true to this order. 
Our approximate test procedure will, therefore, be: 


X wy, —9)?/(k—1) 
(i) Calculate : 





= “, (29) 
[+a Fy (se) | 


heey he [an e-BYT" 


(ii) Refer v? to a variance ratio table entered with degrees of freedom f, and fy. 
The procedure can, of course, be put into a form in which the Incomplete Beta Function 
Tables are used if it is felt that reference to a few percentage points is not adequate. 
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t 6. EXAMPLE 


a single characteristic x being measured. As mentioned in §1 above, the y, are now the 
means Z, for the three treatments. The s? are the individual ‘within-treatment’ variances, 

) estimated on degrees of freedom f,, equal to one less than the number of replicates in each 
case. Also A, = 1/n,. Hence w, = n,/s?. 


| The data in Table 1 relate to an experiment in which three treatments are being compared, 
























































| 
\ 
) Table 1 
Estimated 
Treatment No. of Treatment Observed variance of Wy 
) 
(t) individuals mean variance mean (n,/8?) 
| (m,) (%) (s?) (s?/n,) 
} 1 20 27-8 60-1 3-00 0-333 
2 10 24-1 6-3 0-63 1-587 
3 10 22-2 15-4 1-54 0-649 
2-569 
; 
If the true means yp, are all equal, the best estimate of the common yp will be given b 
) bh q £ y 


& = (Lw,%,)/(Xw,) = 24-10. The details required in this calculation and further quantities 
needed later are shown in Table 2. An arbitrary origin, x = 25, is being used. 
































Table 2 
t w,(%,— 25) w,(%, — 25)? w,/Lw, (1—w,/Zw,)? 1/f,(1—w,/Zw,)* 
1 0-932 2-612 0-130 0-757 0-0398 
2 — 1-428 1-285 0-618 0-146 0-0162 | 
3 — 1-817 5-088 0-253 0-558 0-0620 
| } 
— 2-313 8-985 0-1180 | 
) 
) 
We have YL w,(%,— 2)? = 8-985 — (2-313)?/2-569 
t 
‘ = 6-90. (31) 
Substitution into (29) then gives 
‘ 3(6-90) 3-45 





~ [1+4(0-1180)] 1-029 — =~ (32) 


This must be referred to the variance-ratio table entered with f, = 2, and 
Fe = [§(0-1180)] = 8/(0-354) = 22-6. (33) 


The 5% point for F, corresponding to these numbers of degrees of freedom, is 3-43. If, 
therefore, this point is taken as critical, the experiment just fails to demonstrate that the 
true means /, differ. 
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7. EQUIVALENCE WITH JAMES’S RESULT TO ORDER I/f, 


The procedure described in equations (29) and (30) involves the use of F tables; the procedure 
due to James, described in equations (2) and (3) of §1, involves only the use of the x? tables. 
It remains to show that the two procedures are equivalent to order 1/f,. To do this an 
expression is first required for the percentage points of F in terms of those of x?. Such an 
expression can be found in the same way as one finds an expansion for a t-deviate in terms of 
a normal deviate. If F has degrees of freedom f, and f,, then, to order 1/f,, it may be shown 


that 
roo] 14%- he 2), 
>= 
z 
where F> is the value of F exceeded with oe P and x} is the corresponding value of 
x? with f, degrees of freedom. 


Using (34) in conjunction with (29) and (30) my procedure is seen, to order 1/f,, to be 
equivalent to taking as the critical value of v* 


(34) 





oa 3{xp— (k— 5 
= SL ae Fy ('-S0) | i 
or, in other words, to taking as the critical value of ~ w(y,—9)*, the expression 
3 k+1 
Me) = x64 N=) EK! Sie) | 0 


But this is just what is given in equation (3) of §1. Hence my procedure is equivalent to 
James’s to order 1/f,. 

To higher orders, of course, the two procedures differ. There are obvious points which 
could be mentioned in favour of either method, but no extensive numerical work has been 
done to compare them. We may note that, applied to the example in §6 above, equation 
(36) would give for the 5 % point of ~ w,(%,—2)*, 


h(w) = 5-991[1 + +4 {3(5-991) + 4} (0-118)] = 6-96, (37) 
The observed value of ¥ w,(%,— 2%)? is 6-90, which is again just below the 5 % point, so that 
t 


practically the same conclusion would be drawn. 
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THE EFFECT OF STANDARDIZATION ON A x? APPROXIMATION 
IN FACTOR ANALYSIS 


By M. 8. BARTLETT 
University of Manchester 


(WitH AN APPENDIX BY W. LEDERMANN) 


1. Explanation of the problem. In a previous paper (Bartlett, 1950) I proposed a x? analysis 
to test the significance of the successive roots of the determinantal equation 


| R-A| =0, (1) 


where RF is a correlation matrix of p variables; equation (1) is familiar in the particular 
method of factor analysis known as an analysis into principal components. An independent 
derivation of the x? approximation for the significance of the product of all the roots, 


AAg-.- Ap = | RI, (2) 
was given by Box (1949) in this journal, in a systematic study of y? and variance ratio 
approximations to certain likelihood criteria. In my own contribution, however, I was also 
concerned with the successive reduction of the total x?, corresponding to the elimination of 
successive components, analogously to the reduction of the total x? in a canonical regression 
analysis between two groups of variables (Bartlett, 1938). 

The validity of the x? (or alternative) approximation for such reductions is more difficult 
to investigate in detail, as I have previously pointed out, for it depends on the non-null 
sampling distribution. Fortunately, it has still been possible from geometrical considerations 
to justify the approximation under appropriate conditions. Thus in the canonical analysis 
the elimination of real canonical components tends to reduce automatically the dimensions 
(degrees of freedom) of the observations (n-space), the dependent variables (p-space) and the 
independent variables (q-space). The appropriate conditions are that the components 
eliminated should be sufficiently well-determined in direction; this will be achieved even for 
small real effects as n increases, but alternatively, for moderate-sized n, as the real effects 
increase. 

In the principal components case there are at least two variants of the problem of which 
the commoner is unfortunately the more difficult. In the simpler case equation (1) is replaced 
by the unstandardized variance-covariance matrix, the true variances being assumed equal. 
If the true common variance were also given, the problem would be a special case of the 
‘external’ regression problem; but in the purely ‘internal’ tests for significance of the ratios 
of the roots, we may conveniently divide all the roots by their mean, which is identical with 
the mean observed variance. The geometrical argument may be immediately applied to this 
case, but the further effect of standardization, while readily allowed for in the total x?, is much 
less clear-cut when some components are eliminated, as I was obliged to note in my previous 
paper. Following a more detailed examination of the case of one ‘real’ (i.e. non-homogeneous) 
principal component, I have since added an amending note (Bartlett, 1951) summarizing 
the further conclusions reached. It seems desirable, however, to put on record also the 
details of this investigation, so that the precise basis of these further conclusions is available; 
this is done in the present paper. 
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2. x? approximations for the standardized and unstandardized cases. For reference we note 
first the over-all x? approximations. In the standardized case this is 


x? = —{n—}(2p + 5)} log, | RI, (3) 


with 4p(p—1) degrees of freedom. If V denotes the variance-covariance matrix only 
standardized in the more limited sense explained above, we have 


x = —{n—4(2p+1+2/p)}log,| V |, (4) 
with 4(p+2)(p—1) degrees of freedom. The approximation (3) follows from the moment 
formula th 

TP (Jn) T] (B(n—i) +9} 

K| R\} = “es 

P?-*(jn+t) I] T3(n-2) 
i=1 





(5) 


This formula (originally given by Wilks) can be immediately deduced, for example, from the 
sampling distribution of the correlations r;; in the null case, namely, 


| Rie T?-1(4n) 
= p-1 
ype» Tl T'4(n—1)’ 





flr) (8) 


which itself follows from the Wishart distribution of variances and covariances. 

The approximation (4) is most simply arrived at by combining the y? approximation (3) 
with my x? approximation for the homogeneity of the p variances, the latter y? adding p— 1 
further degrees of freedom. More formally, from the known distribution of the latent roots 
¢, of a variance matrix with true variance matrix the unit matrix, namely, 


Pp 
= 


1 PP 
TH? (ppg --- bp"? exp (-5 >> é) IL I (¢i:-—$is,) 


1 i=1 j=i+1 











ie (7) 
2" TT {T4(m—i) TE(p—i)} 
i=0 
and that of ¢,+¢.+...¢,, namely, 
1? 
(1+ P2+--- bp)e" exp (- 2 x +) 
i= . 
aT (pm) od 
p 
we may obtain that of u, = 1, =o; (=1,2,...,p—1), 
j=1 
p Pp 
TP (Uy Uy... Uy PY TT TT (ug — W445) 
i=1 j=i+1 
oti (9) 


p-l 
pinp TT {P4(m —i) Th(p—it)} 


-1 
pl (4pm) TL P((n—i) +8} 
whence E{\V|4 = =! (10) 


p-1 ? 
Pfbp(n+ 2n)} TL PHn—i) 


and the y? approximation (4) follows. 








>~ ir —-” 
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3. The elimination of one heterogeneous component. If the true correlations are not zero but 
all equal (to p, say), then the first true latent root is 1+(p—1)p, and if this is removed the 
others are all homogeneous with the value 1 —p. It is proposed to examine this case in detail 
when 7 becomes large in order to examine the effect of standardization on the reduced 4’. 

We use dV to denote deviations from the true variance-covariance matrix 


V = (1—p) +88’, (11) 


where for brevity the first term omits any symbol for the unit matrix, and & is a column 
vector with components unity (and &’ its transpose). In the completely standardized case, 
the diagonal is standardized to unity, and no fluctuations in variance are possible, but it is 
convenient to handle both cases together at first. We now write, in the equation 


| V+éV -A| =0, (12) 
1—A = p—2, and we have exactly 
604, —z OVy5 eee 1 OV 49 Ovy5 
OV 45 Ovg,—2... | +pX, | 1 dvg.—% ugg «... | = 9, 
i.e. - A, +p2,A; = 9, (13) 


say, where A, is | dV —2|, Aj denotes the modified determinant with its first column replaced 
by 1’s, and 2, denotes summation over permutations of the anomalous column 1. As we 
are assuming that p is not zero, the second term in (13) is of lower order than the first as 
n increases, and we may consider the asymptotically equivalent equation 


where A, , is a determinant like A, with the first variable dropped, Aj , is like Aj with the 
first variable dropped and 2 in place of 1, and XZ, denotes summation over tbe restricted 
permutations of the anomalous column 2 when the first variable has already been dropped. 
The terms of highest order in x in the determinant A, itself are 


(—x)P —(—x)P-? X, Dal (Sv42)? — 004; 6v99] «.-, (15) 
making use of the fact X,dv,, = 0. Hence equation (14) becomes 
Xy{(— x)? — (—a)P 8X, Dal (dv 95)? — dva9 8vs3]} 
—X, Ddv49{( —x)?-2 + ( — x)? 3 Ly dugg} + DD, Ug dv 4250 g3(— 2)? > +... = 0, 
where =, similarly denotes summation excluding the first two variables, or 
pur? + xP~2(22, X, dv49) 
+ xP-8{2>), Dg dv42(dveg — OUgg) — (p — 2) Z, Do(Sv3g — dvqq4053)}+--. = 9, (16) 


from which the values of £, 7, and X, 2,2, Le, where x; are the roots of (16), follow. 
The asymptotic form of the reduced x? (cf. formula (3) of my original paper) is, for large n, 


+ (p—1)P-1A, Ag... Ay 
weg | ES , 
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which, with A; = 1—p+2x;, becomes on expansion in terms of x; up to the second degree, 


n 
(1—p}? 
Substituting from (16), we finally obtain 


{- X U_%,%o+ {- fe =. (17) 





nm {(p—2)(p?-p+t2) a 2) 
(1—p)? P(p—1) 2X, Z_(8042)? + = 


2(p?—3p+4 —2 2 
pe a By By Ug 80425095 — 1! 2X, 2.4011 bVy9 + p XX, 236042 bv . (18a) 


E, D_E,D, 40426054 


In the standardized case, when dv;; = 0, this reduces to 


(p—2 +2 4(p—2 
aap 2 Ae - LS 1 2(97y2)? + a hE, Yq Uy Uy Ory. 4734 





2(p2-3p+4 
— ts, BB adridrn) (186) 


where, since now 1, is a proper correlation, it is written 7». 


4. The asymptotic expected value of x*. To obtain the expected value of expressions like 
(18a) or (186), we require various large-sample fluctuation formulae. The asymptotic dis- 
tribution of the v,; or r;; being normal for large n, these may be obtained by standard methods. 
Two examples will be sufficient, one from each set. 


: dw dw 
(i) COV (712; 713) ~ PPL {(“S— $004, — j5v) Te $0, — jbu5)| , 
where w,, denotes the completely unrestricted sample covariance (to distinguish it from 
V,, above), or since all true covariances are equal to p, this reduces to 
COV (Wyg, M13) — P COV (Wy9, W1,) — P COV (Wyg, Wag) + $ COV (Wy, Wag) + $ Var (W,;). 


The individual terms may be evaluated by the standard combinatorial rules, giving on 
reduction, under the above conditions, 


M COV (795713) ~ $P(1 — p)* (2 + 3p). (19) 

Similarly, in addition to the well-known formula 
n var (174) ~ (1 —p?)?, (20) 
we obtain N COV (719, 34) ~ 207(1 —p)?. (21) 


FY 1 wy Ll, 
(ii) COV (V9, ¥4,;) ~ pL ie uae (81, + bWo.4+. Buy) [a= dw, some +...+ dy») || 


= eee lee w + oov (w Ws3) 
p 12> “11 p 12> 33/> 


the remaining terms having vanishing coefficients, 


~ -Fa- -p). (22) 








eo 


= ss ke 2 ee hl 
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Similarly N Var (Vy.) ~ (1—p) (1 + p+ 2p? — 2p8 — “© (3 -p)}, (23) 
n0ov (ties %s)~P(1 =p) {1+ 2p— 268 72 (3p), - (24) 
nov (tin:%x)~ 26% (1-p){1—p—= (3p), (26) 
n0ov (01 5e)~ —= (1p), (26) 
COV (043, Ugg) ~ -2a —p). (27) 


Substituting these formulae in the average values of (18a) and (186) we obtain after some 
algebra the expected values, in these two cases respectively, 


4(p—2)(p+1) (28a) 

1 2 

and H(p—2)(p+1)-'P—NP=2) py. (286) 

The first result merely checks with the result following from the more powerful and wider 

geometrical argument, but the second is of considerable interest. It indicates that the 

reduction in the total number of degrees of freedom, due to standardization, is only partially 

felt in the smaller roots, and disappears completely when the components eliminated take 

up the dominant part of the total variance. The degrees of freedom which I provisionally 

suggested for the reduced y? in the standardized case are thus in general too low, and it is 
safer to allocate the number appropriate to the unstandardized case, namely, 


Hp—k-1)(p—k+2), 
when k components have been eliminated. 

It is also evident from the expected value (285) that the x? approximation cannot strictly be 
valid in the standardized case except in the further limit p— 1, corresponding to the residual 
components contributing negligibly to the total variance; this is in contrast with (28a), 
which, as anticipated, holds (for large n) for all p + 0. In this limiting case we might reasonably 
expect the reduced x? for the standardized and unstandardized cases to coincide, but it is 
of interest further to demonstrate that the quadratic expression (185) is then of the correct 
x? form. 


5. The limiting quadratic form in the standardized case. Let us denote the column vector 
of correlations ér,;(i,j = 1...p; 7>%) by &, and the corresponding variance matrix, with 
4p(p—1) rows and columns, by W(p). The quadratic form in (186) may be denoted by 
§’A(p) 8. Transforming to new standardized and uncorrelated variables 


n= W-(p)8, (29) 
we have 5’A(p)& = n W*(p) A(p) W*(p) 9. (30) 
As p> 1, we require to prove that 
B= oe W*(p) A(p) W4(p) 


is a matrix of rank }(p—2)(p+1) with latent roots 1 and 0. Since the factor n/(1—p)? in 
(186) cancels with the factor (1 — )?/n in (19), (20) and (21), we can drop this factor, and then 








342 Effect of standardization on a x? approximation 


put p = 1 throughout. We then find (using the notation W and A for these modified limiting 





matrices) that W = 34 58e’_1P, (31) 
p—-l p*—3p+4__, 1 
PP dont. prs ae, 32) 
Pp p*(p—1) eS Pp 


where & denotes the ‘unit’ vector, and P denotes the symmetric matrix which has zero 
elements p,, except when the suffices r and s, which are of the form r= (i,j), s=(k,1), are such 
that i,j,k and / are all different; in such cases p,, is unity. For example, for p = 5, P is 


o 


o 6 6 8..0o © 
ocol KE eS Oo SO 


0 








1 
] 


mm OoOOCcrwroeo9gccoe © 
ow or Or Ooo & 
coor orr,r OO CO SO 
ome Oo OC CO.Oo OF 
oor Oo CO SO OF - © 
cocooUcOmlUlrRwCUCOCUCUCcCOCUlr FP OOo lh F€ 
cocScocmUcOUlhwrKwLCcCOUCUcOlhwS Oo CUrF 
[S-s.Qaqo ~- G:C. = 


when the order for 7 is taken as 12, 13, 14, 15, 23, 24, 25, 34, 35, 45. 
Itis evident that € isa latent vector of P with latent root 4( p — 2) (p — 3); also&’E = $p(p—1). 
Suppose in consequence we assume that 
(p—2)(p—3) pe, ys , 
Po SE (33 
p(p .; 1) +33 See ) 
where v, are the remaining latent roots, and &, the corresponding latent vectors (with 
&,5, = 1). In view of the structure of A from (32), and the resolution of the unit matrix, 


2 , , 
pip) +ZE,&,, 
we find Aare Mle e 
tiie 
and further, Wawt = WA= (5-2) (F= +2) EE, (34) 


Since $p(p—1)—4(p—2)(p+1) = 1, we require all the latent roots of the matrix in (34) 
to be unity, except for the zero root corresponding to §&’, which has dropped out. This 
requires that the values of all the remaining values pv, satisfy the equation 


aint) 


or (x+p—3)(x-—1)=0 


or z=+1 or —(p-3). (35) 


ee 


i ~~ 











TH 


.") 
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Since the sum of all the latent roots of P is 0, we find that its latent roots should be 


4(p—2)(p—3)... once, 
—(p—8)... multiplicity p—1, 
1... multiplicity $p(p — 3). 
This requirement represents a rather interesting property of the matrix P. I easily verified 


it for p = 3 and 4, but in the general case I am indebted to Dr Ledermann for a proof, which 
he has kindly summarized in the following appendix. 


APPENDIX 


By W. LEDERMANN 


In §5 of the preceding paper a matrix P = (p,,) was introduced whose (3) =4p(p—1) rows and (*) 


columns are specified by pairs of integers r = (i,j), = (k,l), where 7,7,k,l run through the integers 
1, 2,...,p, subject to i<j and k<l. The elements of P are defined as follows: 


1, if 1,7, k,l are distinct, 
*™ o in all other cases. 
We shall prove that the matrix P has a simple spectral resolution and that, if p>3, its characteristic 
maren |A-P| = A-Hp-2)(p-3)} A+p—3)?4A— Hine», a) 


The first assertion, that is, the existence of a pure diagonal canonical form for P, follows at once from the 
fact that P is a real symmetric matrix. 
Next, if it can be shown that P satisfies a polynomial equation 


f(P) = 0, (2) 
then each latent root of P satisfies the scalar equation 
f(z) = 0. (3) 


This follows from the fact that the canonical form of P also satisfies the matrix equation (2). It will 
presently be proved that P satisfies a cubic equation, whence we may infer that the three roots of this 
cubic are the latent roots of P with certain multiplicities. 


From the definition of p,, it is seen that each row and column contains exactly (75°) units and 


—2 
\ ) = (? ° ) zeros. Since P is symmetric, the elements of P? are the scalar products of one row with 


another. Let the symbol (i,j)’ (u,v) denote the scalar product of the rows specified by the pairs (7,7) 
and (u,v) respectively. By counting the number of times that a unit occupies corresponding places in 
two given rows we find that 


ini —3 -—4 
Gi (@v)= Cod ve (75°) 7 (7; ); 


according as the pairs (i,j) and (u,v) have two, one or no numbers in common. Corresponding to the 
three possible values of the elements of P* we write 


Pi = (?5°)r+(?y )e+ ("2 ‘) 2 


where J is the unit matrix of order (3) and P, (P,) isa matrix with a unit element at the intersection of 


the row (i,j) and column (u,v) provided that these pairs of indices have exactly one (no) number in 
common, all other elements being zero. 
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Introducing the matrix (denoted by &&’ in §5 of the preceding paper) 


1 1 1 
Rees 5 te l 
| ee 1 
we have P,=E-P-I, P,=P. 
_2 ac a 
Hence p?=(?—?) 14(?=3) (2@_-p_n+(?-*)P 
2 2 2 
= (p-3)1-(p-4P+(? 5"). (4) 
On multiplying by P and using the relation 
mp =PE=(?,*)z, 
—3 -—2 
we get P= (p-3)P-(p—4) P+ (7 a, )z. (5) 


whence, on eliminating E between (4) and (5), 


p-{(?5*)-r+a}pe-{(?>*) (w-9+p-3] P+(p—3) (?;*)r=0. 


This is the cubic equation for P referred to above. Its roots are easily found to be 

5. —pt+3, 1. (6) 
It follows that |A-P|= {a - (” ia Va +p— 3} {A— 1}, 
where a,b,c are certain non-negative integers such that 


atb+e= (2). (7) 


Also, since the trace of P is equal to the sum of the latent roots, 


a(?5*)+0(-p+a)+e=0. (8) 


Finally, we use the fact that the latent roots of P? are the squares of the quantities (6) with multiplicities 


a,b,c. Hence p-2\ a\./e-2 
a( ; )'+0(-p4ay+e= (2) ( 2 ). (9) 


where the right-hand side is the trace of P?. The numbers a, b,c can be determined from (7), (8) and (9), 
it is fi that 

and it is found tha on i, Ou 1: kde oe. 

This establishes the equation (1). 
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THE JACOBIANS OF CERTAIN MATRIX TRANSFORMATIONS 
USEFUL IN MULTIVARIATE ANALYSIS 


BASED ON LECTURES OF P. L. HSU AT THE UNIVERSITY OF 
NORTH CAROLINA, 1947 


By WALTER L. DEEMER anp INGRAM OLKIN, University of North Carolina 


Editorial Note. The following paper was submitted by Prof. Hotelling in the summer of 1950 for 
publication in Biometrika with the accompanying Note from Col. Deemer and Mr Olkin: 

‘In 1947 Prof. P. L. Hsu gave courses in multivariate analysis at the University of North Carolina 

in which he developed new techniques for finding Jacobians of certain matrix transformations. Hsu 


returned to China at the end of that academic year, leaving as a record of this material only the notes 
of students in his classes. 


‘We (Deemer and Olkin) became interested in these matrix transformations in the course of our work 


in multivariate analysis. Since we did not take Hsu’s courses, we used Ralph Bradley’s notes as a basis 
for our studies. 

‘In the spring of 1948 there was a seminar in multivariate analysis under the direction of Prof. 
Harold Hotelling. At his request we prepared some lectures on matrix transformations. At the com- 
pletion of the seminar, Prof. Hotelling and Prof. R. C. Bose suggested that in view of the importance 
of these: techniques and their non-availability, we should prepare an expository paper giving a 
systematic development with all proofs given in detail. 

‘ All the new ideas of importance in this paper are due to Hsu. Our contribution has been to organize 


the material in logical form, making all proofs complete with the necessary lemmas explicitly stated 
and proved. 


‘Efforts were made to communicate with Prof. Hsu in order that he could review this material before 
it was circulated. To date such efforts have failed.’ 


Since this contribution was received contact has been made with Prof. Hsu and the paper is now 
published with his approval. A suggestion which he made for improvement has been added as a Note 
on p. 361. However, in view of the liberty they have taken in the exposition of Hsu’s ideas, the American 
authors would like it to be clear that they are to be held responsible for any errors. 


FOREWORD 


By Harotp Horetiime 


We are apparently at the beginning of a major development in the use of statistical pro- 
cedures for joint treatment of a multiplicity of correlated variates. Many of the new methods 
depend ultimately on the distribution of the roots of certain determinantal equations. This 
distribution was published simultaneously in 1939 by P. L. Hsu (1939) and R. A. Fisher 
(1939) in the Annals of Eugenics, and, excepting for a constant multiplier, by 8. N. Roy 
(1938-40) in Sankhyd. Hsu’s derivation, which, like Roy’s, is otherwise complete, demon- 
strates the correctness of his formula for this constant multiplier only for the case of three 
variates. Proof that the formula is correct for the general case has turned out to be 
unexpectedly difficult to develop. In seeking to close this gap while lecturing on advanced 
multivariate analysis at Chapel Hill, Hsu developed what appears to be an extraordinarily 
powerful method of dealing with a wide class of transformations of the elements of one matrix 
into those of another induced by a functional relation between the matrices. This technique 
not only leads to a solution of the original problem but appears to offer the possibility of a 
fresh attack on other problems of multivariate distribution theory «=: well. Heretofore it has 
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been recorded only in lecture notes made by students in the spring of 1947. Col. Deemer and 
Mr Olkin have not merely reproduced lecture notes but have reworked the material and 
improved it in certain essential respects. 


1. SUMMARY AND INTRODUCTION 


In finding the distribution of multivariate statistics it is often desirable to make certain 
transformations. However, the calculation of the Jacobian is frequently rather difficult. 
We have therefore assembled in this paper the Jacobians of many of the most used trans- 
formations. 

Certain general methods are applied in the calculation of the Jacobians; these methods will 
be found useful in the calculation of other Jacobians not given here. Three techniques of 
particular value for non-linear transformations are: 

(a) Taking differentials to get a linear equation in the differentials, e.g. Theorem 4-1. 

(6) Transformations to new variables so that the Jacobian can be evaluated as the product 
of other Jacobians which are more easily calculated, e.g. Theorem 4-2. 

(c) Introduction of new matrices such that the Jacobian of the original transformation is 
the discriminant of the bilinear form in the variables of the new matrices, e.g. Lemma 4-3 
and Theorem 4:3. 


2. NoTaTIon 


Because the shape and symmetry of matrices are important in finding Jacobians, we shall 
consistently use the following notation: 


(2-1) Lower-case letters will denote scalars. 
(2:2) As is customary, a prime will denote the transpose of a matrix. 


(2-3) Lower-case letters with a superior bar will denote column vectors, e.g. %’ = (u,, ..., U,). 


(2:4) Capital letters will denote matrices. A system of superior symbols will be used as 
follows: 
(a) A is a symmetric matrix of elements a;;. 
(6) A is a skew-symmetric matrix of elements a,; above the main diagonal and 
ay, = —4a,; below the main diagonal and zeros in the main diagonal. 
(c) Aisa triangular matrix with zeros above the main diagonal, e.g. 


Ay bh 
A2 A2 
Np rp App 
(d) A matrix not in classes (a), (b), (c) will be denoted by a capital letter with no 


superior symbol. 
(e) (dX) is the matrix of the same shape as X, whose elements are the 
differentials of the elements of X. This will be denoted by X*. 
(2-5) Diagonal matrices will be denoted by D with a subscript showing the elements, 
e.g. D, is the diagonal matrix with elements 1,72, ..., 7. 
(2-6) Orthogonal matrices will be denoted by A and I’. 


(2:7) A:pxq means the matrix A has p rows and g columns; the same notation will be used 
for matrices with superior symbols. 








nd 
nd 


‘in 
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(2-8) Since it is the absolute value of the Jacobian that is wanted, it is to be understood that 
all determinants are evaluated up to the sign only. 
(2-9) Let X, Y be two matrices which satisfy an equation Y = f(X). Let X and Y have 
the same number n of independent elements, and denote them by 2j,...,2, and 
Yj; +++ Yn, Tespectively. Then the Jacobian, Cites , will be denoted by D(X ; Y). 
b Re hos} 
For purposes of calculation it is sometimes desirable to express this in terms of the 
elements of X and Y, for example, D(X; Y) might be written D(x,,, x45; yi, Y4)- 
More generally, if X,,...,X,, ¥j,...,Y,, satisfy the equations Y, = f,(2,, ...,x;) 
(¢=1,...,m) and (X,,...,X,) and (Y,,...,¥,,) have the independent elements 
Ly, ...,%y and Yj, ...,Y,, respectively, then D(X,, ...,X;; Yj, ...,¥,,) denotes 
O(a» +++» Yn) 
Abas --29Bq) 
(2-10) The following abbreviations will be used: p.d. for positive definite; n.s. for non- 
singular; J for Jacobian; tr for trace; [, for the identity matrix of order p. 


3. THE JACOBIANS OF LINEAR TRANSFORMATIONS 

We state here a number of theorems on the J of linear transformations, some without proof. 
When the J does not immediately follow from previous theorems we will give the proof. 

THEOREM 3:la. The J of the transformation Y = aX (Y, X: px q), where a is a constant 
scalar +0, is D(X; Y) = a9. 

Proof. y,; = ax,; and dy;,;/dx,; = a. Therefore the scheme of coefficients is diagonal with 
pq a’sin the main diagonal. Q.£.D. 

THEOREM 3-16. The J of the transformation Y = aX.(Y,X: px p), where a is a constant 
scalar +0, is D(X; Y) = aiv+, 

THEOREM 3-2. The J of the linear transformation 7 = AZ (J,%:px1; A: pxp) is 
DE) =|4|. | 

Proof. y; = X a%,%, and dy,/0x; = a,;, therefore J = D(z; 7) =|A|. Q.B.D. 

k=1 


CoroLttaRy 3:2. The J of the transformation %= Ay (%,9:px1; A:pxp) is 
Dy; %) =| A|-. 
This follows from Theorem 3-2, since | A-!| = | A |-1. 
THEOREM 3-3. The J of the transformation 
y =2'A'(j,Z:pxl; A:pxp) is D(%;¥)=|A|. 
THEOREM 3:4. The J of the transformation 
Y = AX(Y,X:pxq; A:pxp) is D(X; Y)=|A|¢. 
Proof. This follows from Theorem 3-2 and 5 B. 4+, since the transformation of each column 


of Y is independent of the others and there are g such columns of Y, the J of each column 
transformation being | A |. 


TuroreM 3-5. The J of Y = XA (Y,X:pxq; A:qxq)is D(X; Y) =|A|?. 
Proof. Write Y’ = A’X’ and the result follows directly from Theorem 3-4. 


+ References of this form are to the appendix to the present paper. 
Biometrika 38 23 
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THEOREM 3-6. The J of the transformation 
Y = AXB(Y,X:pxq; A:pxp; B:qxq) is D(X; Y)=|A|*|B|?. 
Proof. Let Z = AX and then Y = ZB. D(X; Z) =|A|¢ and D(Z; Y) =|B|?. Using 
5 B. 2 the theorem follows. 


Lemma 3-7. Let Y = A,A,_,...4,XAj... 45,4), 
then D(X; ¥) = D(X; ¥,) D(%; Ye)... D&q1; Y), 
where Y,= A,Y,,4; (¢=1,...,%); Y=X, Y,=Y. 


Proof. Follows directly from 5 B. 2. 
THEOREM 3-7. The J of the transformation 
Y = BXB(Y,B,X:pxp) is D(X; Y)=|Bl?4. 
Proof. By 5 A.5 we can write B = F,, F,,_,....,, where some of the F’s are of the form 
5 A. 1 and the rest are of the form 5 A. 2. Then 


(3-7-1) Y = F,,... F,XF;... Fm 

By Lemma 3-7: D(X; Y) = D(X; Y,) D(Y,; ¥,)... D(Yn_1; Ym)s 
where Y=uFY,.F (v=l,....m); Y 2X, Yiwe!. 
We show that D(Y,_,; Y,)=|F|?! (v=1,...,m), 


and hence it will follow that 
D(X; ¥) = | Fy |?*... | F[P4? = | Bq PH = | BPH 

(1) Let @ be any of the F’s which are of the form 5 A. 1; the transformation Y, = GY,_,@’ 
implies Yur = Vy; Yig = AX, (t+); Yin = jy, (9,4 +4). 
The determinant of the partial derivatives is therefore bet F with p— 1 elements a, one 
element a?, and the remaining elements 1. Therefore D(Y,_,; Y,) = a?+1 = |G on 

(2) Let H be any of the F’s which are of the form 5 A. 2; the transformation Y, = HY,_, H’ 
implies Yui = Ty + 2axy+a*,,, 
Yui = Vik = Ty, t+ary, (k+%), 
Yin = Xz (J, kt). 
The determinant of partial derivatives will have 1’s in the main diagonal and 0’s below the 
main diagonal. Hence DY,_4; ¥,)=1=|A|?#. ono. 
CoROLLARY 3-7. The J of the transformation 

Y = A1XA’ (Y,X,A:pxp) is D(X; Y)=|A|-°». 

THEOREM 3-8. The J of the transformation Y = AX (Y¥, A, X: p~ p) is D(X; Y) = Tat. 


j 
Proof. Yig = XL 4;%y (V<)J). 


The elements on’'one side of the main diagonal of the determinant of partial derivatives are 
zero. The main diagonal contains a,, from dy,,/02,,; it contains a. twice: once from dy;5/02 9, 
and once from OY 29/0X2; it contains a,, i times: once from each of dy;,/Ox,;, (j = 1, ..., 1). 
Therefore the determinant is ll ai;. Q.E.D. 


i=1 








nn Oh lCU'...lClUCUlC 
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THEOREM 3-9. The J of the transformation 


Y = 4X'+ 84 (Y,4,X:pxp) is D(X; Y) = 2° ff ag-*, 


i=1 


Proof. Pre- and post-multiply by 4-1 and A’- respectively: 


(3-91) AAYA' = Xf’ 4 J-%. 

Let Z = 4-1YA’— and 0 = 4-1X. Then 

(3-9-2) . Z=0'+0, 

(3-9-3) D(X; Y) = D(X; 0) D(O; Z) D(Z; Y) 
by 5 B. 2; 

(3-9-4) D(X; 0) = Tag 

by Theorem 3-8; Tr 

(3-9-5) D(z; Y) =|A |p = [hag 

by Corollary 3-7 and 5 B.1; se 

(3-9-6) 2 = 2u, and 2%; = Uj, - 


from 3-9-2. The determinant of partial derivatives is therefore diagonal. It contains p 
elements equal to 2 and the remainder 1’s. Therefore 


(3-9-7) D(O; Z) = 2°. 
Substituting in (3-9-3) D(X; Y) = 2” Ty az-4, Q.E.D. 
i=1 


THEOREM 3-10. The J of the transformation 
Y = GX@’ (Y, X,@: px p) is D(X: Y) =|G|?-1. 
Proof. The proof follows along the same lines as the proof of Theorem 3-7 with the necessary 
changes due to X and Y being skew symmetric. 


4. NON-LINEAR TRANSFORMATIONS 


The general method for finding the J of a non-linear transformation is to take the differential 
of both sides and then to find the J of the resulting linear transformation in the differentials 
(see 5 B. 3). 


THEOREM 4-1. The J of the transformation V = 11" is D(T; V) = 2° TLR. 
me i=1 
Proof. Taking differentials of 7 = 1'7", : 


(4-1-1) dV = TdT) + (dT) 7". 
Let V* = dV; 7'* = dT, then 
(4-1-2) V* = TT *' 4+ TT", 


This is the transformation of Theorem 3-9, and hence the theorem follows. Q.E.D. 

Lemma 4-2. Let A, B be p.d., px p matrices. Let | A—¢B| = 0 have p distinct roots 
$1, > $2>--. >, > 0, and let D, be the diagonal matrix with elements ¢,, ...,¢,. Then there 
exists a unique matrix W = (w,;): px p, where w,;>0 (¢ = 1,...,) such that A= WD,W’ 
and B= WW’. 


23-2 
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THEOREM 4-2.+ The J of the transformation A = WD,W’; B = WW’ (all matrices p x p) 
is ote DP 
D(W, 9; A, B) = | W |P+# 2” Th (9:— $5). 
1< 
Proof. Taking differentials: 


(4-21) dA = (dW) D,W' + W(dD,) W'+WD,(dW)’, 
(4-2-2) dB = W(dW)'+(dW) W’. 

Let A*=dA; B*=dB; W*=dW; D,=4dD,, 
then 

(4-2-3) A* = W*D,W'+WD,W'+WD,W*’, 
(4-2-4) B* = WW*'+ W*'W’. 


By 5 B.3: J = D(W*,7; A*, B*). Pre- and post-multiply (4-2-3) and (4-2-4) by W-! and 
W’- respectively: 


(4-2-5) W-4*W' = WW*D,+D,+D,W* WwW, 
(4-2-6) Wo B*w’ = W* W'4 WoW. 

Let 

(4-2-7) U = W-1A* WwW’, 

and hence A* = WUW’'; 

(4-2-8) V = WB Ww’, 

and hence Be=WVw'; 

(4-2-9) R= W-W*. 

Substituting in (4-2-5) and (4-2-6): 

(4-2-10) O = RD,+D,R'+D,, 

(4-211) V=R+R’; 

(4-2-12) D(W*,»; A*, B*) = D(W*,; R,y) D(R,n; 0, V) DO, V; A*, B*) 
using 5 B. 2; 

(4-2-13) D(W*,9; R,) =|W\| 

from (4-2-9) using Theorem 3-4 and 5 B. 4; 

(4-2-14) D(O, V; A*, B*) = D/O; A*) D(V; B*) 
by 5 B. 4; 

(4-2-15) D(0; A*) = D(V; B*) =| WP 

by Theorem 3-7. Hence 

(4-2-16) D(U,V; A*, B*) = | W |%+, 


To find D(R,7; 0, V) we use (4-2-10) and (4-2-11): 
Ug = WGitM, ly = Wy, 
Ug =TyPyt TP, Vy =Tytty (t+)). 


+ For an application see Hsu (1939). 
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The scheme of partial derivatives will then be as follows: 








N: "i Vi 145. 

(@=1,....p) (@=1,...,9) (¢>J) (<j) 
u,, (¢ = 1,...,p) I L 0 0 
V4, (¢ = 1,...,p) 0 2I 0 0 
uj; (i<j) 0 0 D C 
vj; (i<j) 0 0 I I 








where C is the diagonal matrix with elements $y, ds, ...,6,; $3; ---s Pps --+3 Ppa Pp; Pp» and 
D is the diagonal matrix with elements ¢,, ...,¢, (p—1 terms), ,,...,¢, (p—2 terms), ..., 
Pp-1: 


The matrix L does not affect the value of the determinant, which is equal to 


Pp 
2” | D-C| = 2” 1 (4,—$}) 
ws 
by applying 5 A.7 twice. Hence 
Pp 
m = 2P | W |p+2 Il (d;—9,)- Q.E.D. 
i<j 


Lemma 4:3. Let X, = (x$?), (p, x q,) and X, = (x!), (p, x q,) be two matrices containing 
a total of N independent variables (N <p,q,+29,) and Y, = (y$P), (p:x%), Yo = (y{?), 
(2 x g,) be two other matrices containing the same number of variables. 

Let =: N x 1 be the vector formed from all N x’s by using the elements in the first column 
of X, as the first p, elements of =, the p, elements in the second column of X, as the next 
p, elements of =, ete., through the columns of X,. 

Let H: N x 1 be the vector formed from the columns of Y, and Y, in the same way that = 
was formed from X, and X,. Thus 


= 1) (1 1) ld) 2 2 
SB! = (2 of)... 2 2 of... 2,), 


_ (yA1) 4 (1 1) 42) 4f2 2 
H’ = ( yf... Oa. Yt @... Pa.) 


Let = = AH (A: Nx), where A is a matrix of constants, be the connecting equations 
between the variables x{} and y{ (v = 1, 2). 

Let R, = (r$?), (gq, x p,) and R, = (7$}), (q x p,) be two matrices with a total of N inde- 
pendent variables. Then D(Y,, Y,; X,, X.) is equal to the discriminant of tr (R, X, + R,X,), 
where tr (R, X,+ R,X,) is to be considered as a bilinear form in the variables of R,, R, and 
Y,, Xs. 

Proof. Let u(N x 1) be the vector formed from the elements R, and R, by taking the rows 
of R, in turn and then the rows of R, in turn: 

fe! = (PD. Ta AB «+» Mya,» 
2 qq Py am z 
tr(R,X,+R,X.)= DY LY all =wE =p’AH. 
v=li=le=1 
The discriminant of this bilinear form is | A |. 

The J, D(Y,, ¥,; X,, X.) of the original transformation = = AH is | A| (by Theorem 3-2). 

Q.E.D. 
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Introduction to Theorem 4-3 


Let V = Ny ” be p.d. (Viz: px p; Vig: pq; Vog: 9x q) (p<q). There exists a n.s. 


matrix G of the form - a (G,: px p; Gy: q xq) such that 


.'s ¢ 
(4-3-2) GVG@ = |D, J 
0 t 


where D,: p x p has elements 1, ...,7,; 7j-.. 73, are the roots of the determinantal equation 


I, D, 0 
(4:3-3) V=G@4 [2 F Jo 

0 qa 
from (4-3-2). Let = P 0 

ar ae 
then 

a a 

= Fr Gil? ® tis 

wos Ee all& 3, |[o el 


This is a transformation from the v’s to the p’s, g’s and r’s. V contains }(p+q) (p+q+1) 
variables, and the right side of (4-3-4) contains p?+q?+~ variables. Let d = q—p, then 
[(p?+94?+p)-—4(p+ 9) (p+ ¢+4+1)] = 46(6-1), and the number of variables on each side 
will be equal if p = g or p = q-1. 

We now partition Q and rewrite (4-3-4): 


a Pr « 0 J, D, 0 P’ 0O 0 
(4:3°5) V= 2 Qu en [3 } J | 0 Qi e, ’ 
0 Qe Qee 0 0 I; 0 Qie Qe 


(Qu: PX DP; Qe: p x8; Qa: xd). 
Post-multiplying Q,, and Q,, by an orthogonal matrix A will not change the value of this 


product, i.e. P 0 0 I, D, 07) [P’ 0 0 
V= [> Qu an D, i. J f Qin Qo1 | 
0 Q1 G24 L9 9 FILO AQ. A’Q. 


as may be verified by multiplying out this expression. By 5 A. 10 we can choose this A so 
that Q,,.A is triangular with 0’s below the main diagonal. When we do this we reduce the 
number of variables on the right by 46(é— 1) and thus have the same number on each side. 
The transformation is unique if r,>...>r,>0; p,;>0(j=1,..., p). We now find its J after 
the number of variables is equalized. 

Hsu has used this transformation in finding the distribution of the canonical correlation 
coefficient (see Hotelling, 1936) when the population values are not zero. Hsu’s work on 
this has apparently not been published. 


THEOREM 4:3. Let Q= “9 | 
21 


(Qi: PX DP; Qi: pxd; T:8xd) (p<g), = q—-—p; ty>O (i =1,...,8). 


we ————— 








1.8. 


ion 


-1) 
en 
ide 


his 
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j & 6 
Let ® = D, . 
0 fy 
(D,: p x p). Then the J of the transformation 
i = ie / 
(4-3-6) V = I oll 
=_— é . 
is D(P,Q,r; V) = 27+4| Pla | Q (11 é)( I r) (ii (t-r). 
i=1 i=1 i<j 


When g = p or g = p+1, the terms in ¢ do not appear. 
Proof. Take differentials of (4-3-6) and let V* = dV, P* = dP, Q% = dQ,,, Q& = dQyp, 
Qi = dQ,,, T* = dT, D, = dD,, and let 


QM Qh 
* a ° 
B= los to}? 
then 
0 D, 0 
a r al ?* wr tery ” ‘0 
-2. Vt = 
ay ‘0 ello al*Lo allo oe] *Lo [o, : l(t Q 
0 0 0 
ua 1 
J = D(P*, Q*,7; V*) by 5 B.3. Pre-multiply (43-7) by i eal and post-multiply by 


ee Me 4 : , 
Q- and introduce new variables: 


0 

ot fF" Clair oF 
ao Te a 
(4-3-9) A = P1P*, 


ie [an] - © Los] 


To simplify the notation, we write 


(4:3-11) Fa =@Q" 7 |. 


but we do not consider B,, and B,, as new variables, since Q-!7'*’ is not triangular, and hence 
the number of variables in B,, would be greater than the number of variables in T*’. We 
then get from (4-3-7): 








= A’ 0 0 0 D, 0 
(4-3-12) X=|0 By, By |o-a] Bi m+ [> 0 J 
‘ais Se 0 B BB) le oo 
{ A+A’ AD,+D,By,+D, D,By 
= |(AD,+D,By, + D,)’ By t+ By By.+ By 
; (D, By)’ (By, + By,)’ Boz + Bye 
By 5B.2 


(A) J=D(P*,Q*,7; V*) 
D(P*, Q*, 4; A, By, By, Qh; P*, 9) D(A, By, By, Qi, 7+, q; X) D(X; V*) 
= D,D,Ds. 
By 5 B.4 * D, = D(P*; A) DQ, Qh; By, Bay) 
= | P{*j9|" 
(from (4-3-9), (4-3-10), using Theorem 3-4). From (4-3-8), using Theorem 3-7, 
(B) D, ak D(X; V*) — | P |pta+t | Q |Ptat, 
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To compute D, we partition X and hence can equate the submatrices of X to the submatrices 
on the right of (43-12): 


a Xu Xin Xiz = = a 
X= 12 Xx Xog| (Xi, Xig,Xo9: 9x YP; X13, Xog: p x8; Xgg: Fx 8). 
Xi3 Xt, Xzy 


We then have 


(4:3-13) X,,=A+A'’; X= AD, + D, By, + D,; Xoo = By, + Bi; 

(4:3-14) X13 = D, By; Xeg = Byz+By; Xaq = Bog t+ Bry. 

The transformations in (4-3-13) and (4-3-14) are independent, so we can write 

(C) D, = D(A, By, 9; X43, X12, X22) D(Bay, Qo, T*; X13, X23, Xs) 
=> D,D;. 


Computation of D,. For convenience we will relabel the elements of X: 
(43-15) Xy= (23); Xa2 = (Yay); Xv = (245); 
alli,7 = 1,...,, and we denote the elements of the other matrices as follows: 
A=(aj); By =(by) (47 =1,...,2); 9 = (91 -+» Np): 
Using (4-3-13), (4°3-14) and (4-3-15) we get 
(4-3-16) Meg = ys Ye = MAH byt 2% = Wy, 
(4:3:17) a =aAyt+ay (t<J); Yig = 154+ by (t+); iy = by +b, (t<)J). 
The transformations in (4-3-16) and (4-3-17) are independent, hence 
Dy = D(A, By, 9; X43, Xp, X99) = Di, bis Mas iss Yair Xia) (Gigs bigs Legs Yass 2s) 
For D(a, b¢: 0:3 Ties Yaa» Zz) WE have the scheme of coefficients: 
Ag by 1 
Z| 2 0 0 


Cu 0 2I 0 = 227, 
Yii | LMI 
the values L and M not affecting the determinant. To compute D(a,;,b;;; X;;, Yi, 2) We 


rewrite in (43-17): Rey 
( Yig = 157A tb; (i<j =1,...,p), 


Yxi = Ay, +1; b5, (a <j => EA, 
We then have the scheme of coefficients (i <j in all cases) 
Oy Dyg Oye Duy 


ty |I 0:7 0 
z,\/O0 1:0. 


D0 0 
: rw wwe | oi-le o|| = |D||D-CD-=0| = | D*-0*|, 
jt 
¥y|C 0:0 D 








using 5 A.6 and 5 A.7 twice. C is a diagonal matrix with elements 


Tay 1 gy 20091 ps Tay eee T ps +63 Tp Tp Mp3 

















6s 
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Dis a diagonal matrix with elements 


11, ++. (p—1 terms), rg, ...,72( — 2 terms), ...,%p_4- 


Pp 


Therefore D?—C? = [I (3-79), 
i<j 
and hence 
> =< P 
(D) D, = D(A, By, ,9; X41, X12, X09) = 27” IL (3-79). 


Computation of D(By,Q%,7*; X43, X23, X33) = D;. The connecting equations are given 
in (4-3-14). B,, is independent of the other variables, but recall that the original variables 
are Qi, and Q% and that the relation (4-3-11) does not define B,, and B,, as new variables. 

The scheme of coefficients, from (4°3-14), is 





By Qi T* 
X,|Fji0 0 
(4:3-18) oe ae |F||@|, 


so the value of L does not affect the value of the determinant. | F'| is the J of the trans- 
p 8 
formation X,, = D,B3, (X43, Bj: p x 6; D,: p x p) which by Theorem 3-4 is | D,|° = (11 r) ; 
i=1 


Since the value of B,, does not affect G, we can, for determining G, write X,, = B,, and 


X33 = By. + Bj,. Therefore 
p é at 
(E) DB; (ti) Dg; Dy = D(Q% T*; Xzg, Xqp)- 
i= 


Computation of D,. The connecting equations between By», By; Qi, T'*; Xos, Xgg are 


m]-e-(2] 


Xo = Bis, 

X55 = Bry + Bry. 
As previously stated, B,, and B,, cannot be considered as new variables. We therefore do 
not have any direct equations between the variables of X,,, X,, and Q%, 7'*. It is clear, 


however, that the elements of X,3, X,, are linear functions of the elements of Q%, 7'*, and 
hence we can apply Lemma 4:3. 
Let W = (w,;): (6x p) and 8: (6x4), where S = (¢,;8,;) (i,j = 1,..., 6); 6g = 1 if t =]; 
€y = dif i+). 
Let trH = tr(WX,,+5Xz3) 
= tr(WBy.+SByy+SByq) = tr (W By, + 28Bgq) (by 5 A. 12); 


~ (orn [ 52) 


“ tr ((W 28) K 7 |). 
where K = Q-1. 
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Writing Q%, = (4,) (i = 1,...,93 j = 1,...,8), T* = (ty) (i<j = 1,..., 8): 

By spd Ars | 

Wy, «+ Wy Ms Ci a eee eee) aes 


(4:3-19) trH=<tri°" °" ve 83g 28g ... Sag K Pe Bos 


Wy, +e Wp i Sy eee eee 2855 








. wee T 33 


é a 
HiséxéandtrH = Y/h,;. h,;is found by multiplying the jth row of (W 28) into K and then 
j=1 
q™ 
‘ Bs) 
By3 


* 
multiplying the resulting row vector into the jth column of 7 |: Therefore 


é 
trH oa Mn eee W5jp 851 eee 285; eve 833) K 
= Ti 








E 73 
We now partition K so that the partition is a function of j: 


KY RG ; 
=L KY ral (1 <j <9), 
(KY: (p+j) x (p +i); K&: (p+j) x (8—J); KQ: (6-3) x (8-3). 


Now we can write 


(43-20) Qi=K 














f Bas) Pay] 
: @) Bos ) Bos 
(43-21) trH = De (Wy -- Why 85 «+ 28;;) KY + (85 544 --- 839) KX = 
7 Ty Tj 
SG ty i L 753 4) 








By Lemma 4:3, D(Q%, 7'*; X.3, X,3) is the determinant of this bilinear form. Writing the 
terms for j = 1,2,...,d in order we find that the discriminant has 0’s above the main 
diagonal, and the product of the terms in the main diagonal is 





j 
(4:3-22) 2| KQ|2| KP ...2| KP] = 2° 1 | KPI; 
j=1 
| Aa Rapa 
| K2| = | in ; 

| Rpts = Kpsspss 
where the elements k;, are the elements of Q-!. Let A,, be an element of the adjoint of Q. 
Then k,, = ra A,,. Therefore 

' 1 Ay ava Ay 43 | 1 | Up+5+tpt+s+1 -*+ Up+j+i,p+e 
Apts --» Apts psy | Yp+8,p+i+1 + ~~ Up +8, p48 














hen 


the 


ain 
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by 5A.11. But Q= oa = (Qi: PX P; Qi: px 8; T’: 8x8). 
21 


Therefore Ip+k+ip+htt = 'e4t,rsi 
Therefore 


(4:3-23) | KQ| = ral 


bysaj41 id bine 








OU tay 
On substituting this result in (4-3-22) we find 
(F) De = D(Qh, T*; Xos, X9) = 2 TK = 2°|Q|-*t,,.4, ... 5". 
We now consolidate the results from equations (A) through (F): 
J = 2p+4| P|att| Q ptt (11 ti) (i ri) i (ri—r}). Q.E.D. 


THEOREM 4-4. The J of the transformation 
(4-4-1) 0 =|V\ 7, 
where V: px pis p.d., is D(V; 0) = (p—1)| V |toe-, 
Proof. Take differentials of (4-4-1) and let U* = dU, V* = dV; then 


(4-4-2) 0* =|V|tr(V-V*) V7 -| V | (V27*7), 
using 5 A.13, 5 A.l4and 5 A. 15. 
By 5 B.3, J = D(V*; U*). 


V, and hence V-1, is p.d., so by 5 A. 16 we can facter V-! into 77”. Substituting this for 
V-— in (4-4-2) and pre- and post-multiplying the result by 7'-! and 7’ respectively, 
(4-4-3) T20*T'> = | V | tr(1’V*2)I-| V| (1 *2). 

sig Z =T-0*T Z 

sip Z=T7U*T" (2: pxp), 
ali fa = T'V*? (W: px-p). 


By Theorem 3-7 and Corollary 3-7 


D(O*; Z) =|T |, 
(4-4-5) Pe. i? we 
Putting (4-4-4) in (4-4-3) 
(4-4-6) Z=|V\(trW)I-|V|W. 
Then D(V*; 0*) = D(V*; W) D(W; Z) D(Z; 0*) (by 5 B. 2) 


= |T |p DW; Z)|T pH 
=|V|-° DW; Z). 
Computation of D(W; Z). Let 
(4:47) S=|V\|W (8: px>p). 
By Theorem 3-15: D(W; 8) = | V |#e@+, 
Substitute (4-4-7) in (4-4-6): 
(4-4-8) Z = (trS)I-S, 











358 The Jacobians of certain matrix transformations 


whence we find 





Pp p 
(4-4-9) 24 = 2D 8y— 8 = LX Sy 
p=] v=] 
vi 
02; 02,5 02; 
Therefore —* = 0; a =1 (t+)); a =0 (¢+)). 
r Tau aay (t+)) 28,, (¢+)) 
ay = Si (2 <)); 
024; 
and therefore =" =—1, 
C8; 
The scheme of coefficients is 
85; Sij (i <)) 
(4-4-10) Z4% H 0 
Zy (t<J) 0 id 
the determinant of which is 
0 ] 1 iT 
(4-4-11) a io 1 0 2 a. = D(S; Z). 
1 1 1 0 


Let h be a column vector, px 1, consisting entirely of 1’s. Then hh’ is a px p matrix 
consisting entirely of 1’s, and D(S; Z) = | hh’ —J, |. Using 5 A. 8 we get 


Consolidating the results 
J = D(V*; U*) = D(V*; W) D(W; 8) D(S; Z) D(Z; 0*) 
= | V |- +) | V |:v@+D (ym — 1) 
=(p-1)|V [e+ve-2, Qz.p. 


CoROLLARY 4:4. The J of the inverse of the transformation of Theorem 4:4 is 








D(U; V) = we. | U |e +D@-22@-2), 
p-l1 
Proof. |\T|=|7| |Vj7A=|P 7 
and therefore |V| =| 0 |w-», 
DU; V) = 2 —T . | V |-@+D@-2) 
; DV;U) p-l 
= a | U —-(p +1)(p—2)(2(p—1)) Q.E.D. 
p-—l 


THEOREM 4:5.+ The J of the transformation U = T2(X +I)-TJ] is 


A p-- i . A A 
D(T, X; U) = 2-0 ( Ty n-*) |X+7|-9-) (X,7,U: pxp; Uns). 
i=1 / 
This transformation is unique if | 7’-1U +7 | +0 (see 5 A. 18). 


+ This transformation has been used by P. L. Hsu in unpublished work on the distribution of roots 
of determinantal equations under certain non-null hypotheses. 























ee ee 





a 
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Proof. Since X isskew symmetric it does not have a characteristic root — 1, so | X41 | +0. 
By 5 A.17 and 5 A.19 we can write UU’ = T7". Therefore T-UU'7"— = I, and hence 
I = TU is orthogonal. Then 
(4-5-1) U=7T =7xX+1)-I] (by 5 A.18). 


Take differentials of (4.5.1) and put U* = dU; 1* = d?; X* = dX. Then using 5 A. 20 
we get 


(4.5.2) U* = T*P—-4T(P 41) X*(P 4D), 
and by 5 B.3 
(4-5-3) J = D(T*, X*; U*). 
- W =T0*T", 
(4-5-4) fF =T-T*, 
Y = ((04+J)X*(1" +1). 

Pre- and post-multiply (4-5-2) by 7-1 and I’ respectively and substitute front (4-5-4). Then 
(4-5-5) W = R-4Y. 
By 5 B. 2: 
(4-5-6) D(T*, X*; U*) = D(T*, X*; R, Y) D(R, ¥; W) D(W; U*), 


which by 5 B. 4 becomes 
= D(T*; R) D(X*; Y) D(R, Y; W) D(W; U*). 
From (4-5-4), using Theorems 3-6, 3-8 and 3-10, 
D(W; U*)=|T |? (since || = 1), 
PD 
D(T*; R) = Ita’, 
i=1 
D(X*; ¥) =|P4+2 |e. 
From (4°5:5) Wye =Tys Wy=—Wy (t<J); 


Wy =Tythyy (t<J). 
The scheme of coefficients is 





Tn Yas (<9) Tig (@<)J) 
Wi I 0 0 
wy (i<j) 0 -4 0 
Wi; (t<J) 0 41 I 
Therefore D(R, ¥; W) = 2-470-», 


Hence, consolidating the results, 
p-1 ; 
J = 2-*p@-1) ( -*) | T+ |>-2, 
1 


i= 


By 5A. 19, | T+J \p-1 = 2pp-1) | X47 |-», 


p-1 A 
and we have J= zino-0/ Il a‘) |X+J|-?-», Q.np. 
i=1 
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THEOREM 4-6. The J of the transformation 
5 = (2(X+1)2-1D, {2% +) -1) 


is D(X, 0; 3) = 20-0) $47 [ool (0,—0,) (8, Dp, X: pxp). 
Proof. By 5 A.19 we can write 
(4-6-1) S=TD,!I". 
Let S* = di, 
(4-6-2) FS = dX, 
D, = 4Dz. 


Take differentials of (4-6-1) using 5 A. 20 and substitute from (4-6-2): 
(4:6°3) 8* = -}4(4+]) R*(P 4.1) DW’ + 4D, (I" +1) X41" +1) + rD,!". 
(Note that x’ = ne By 5 B.3 


(4-6-4) J = D(X*,; 5*). 
Let - . 
(4-6-5) Y= sr, 


¥ = (41) X*(P4+0). 


Pre- and post-multiply (4-6-3) by I’ and [' respectively and substitute from (4-6-5): 


(4-6-6) V=-3YD,+4D,¥+D,, 
whence from 5 B. 4 
(4-6-7) D(X*, 9; 8*) = D(X*,9; ¥,9) D(¥,9; V) DV; 3*). 
From (4-6-5), using Theorems 3-7 and 3-10, 
VY. Xe) — 

(4-6-8) mes ot dee 

D(&*; Y) = | T+ )P3. 
From (4-6-6): Me = es Mey = BOY — 49545 = HO-—9) 45 (6<))- 
The scheme of coefficients is Ms Ys 


v,| 1 0 
vy |0 H’ 
where H is diagonal, with elements 


(9, — 9s), ery +(9, —9,), $(9,—93), sey +(0,_,—9,), 
a total of $p(p— 1) terms. Therefore 


(4-6-9) D(¥,9; V) = 2-6-9 T] (6,—4,), 
i<j 

and 

(4-6-10) J =|T41|r12+470-» TT (6,—6,); 
i<j 


whence, using 5A.18, J = 240-0| X47 |-0-0 7] (6,-6,). Qu. 
i<j 
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Note. The following comments relative to the uniqueness of the transformation of 
Theorem 4-6 have been made by P. L. Hsu in a communication to the Editor: 
‘THEOREM. The equation 


= [2(1 + X)-1 D,{21 +X) - 1)’ 
establishes a (1, 1)-correspondence between S and Xx , provided that 


(i) S does not belong to a certain set of symmetric matrices which constitutes a set of 
measure zero (in the $p(p+ 1)-dimensional space), and 


(ii) the elements (1, 2), (1, 3), ..., (1, p) of the matrix (I +X) are all positive. 
Remark on the above theorem. The condition (ii) on z may be put in a simpler form as 


follows: if we write 
A 0 - ), 
(° Y 


then the row formed of the elements (1, 2), (1, 3), ..., (1, p) of (+X) is equal to 
#(I+ Y)- 
14+2%(1+ 2)72’ 
Thus an equivalent to the condition (ii) is the following: (ii), every element of the row 
%(I+ Y)~* is positive. 
As to the proof of the above theorem, which is not very short, I intend to give it in a 
paper to be published in one of the journals issued by the Chinese Academy of Sciences.’ 





5. APPENDIX 


A. Certain matrix theorems 


The following definitions and results from the theory of matrices and determinants are 
stated for reference purposes. 


5 A. 1. oor yt of the ith aad of a matrix by a constant a+0 may be accom- 
plished by ? pee » p Multiplication by a diagonal matrix D, with a in the ith position and 1’s 


elsewhere. | D| =a; D- is a matrix of the same type as D (see Frazer, Duncan & Collar 
(1938), § 3-11). 

(jth row) ith row 
recs gly pho (ith column) , jth column 
containing 1’s in the main diagonal, a in the ith row and jth column, and 0’s elsewhere. 
| #| = 1; Z-‘is a matrix of the same type as EZ (see Frazer et al. (1938), § 3-11). 

Any combination or succession of a finite number of operations of types 5 A. 1 and 5 A. 2 
is called an elementary transformation. 

5 A.3. The rank of a matrix is unchanged by an elementary transformation (see Bécher 

(1938), § 19). 

5 A.4. Canonical forms. A px p autels, A of rank r can be reduced by an elementary 
transformation to a canonical form, i.e. to a diagonal matrix with 1’s in the first r positions 
and 0’s elsewhere. The reduction to a diagonal matrix of the characteristic roots can be 
effected by an orthogonal transformation (see Bécher (1938), p. 280). 


a+0, a0 multiply by a matrix Z 
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If the matrix A is p x q, p <q, of rank r, there exist two orthogonal matrices [': p x p and 
A: qx qsuch that T’AA = (AO), where 


A, 0 
a-| *s (p xp) 
0 A, 


and A? ... A? are the characteristic roots of the matrix 4A’. Notice that p —r of the A; are zero 
(see Hsu, Quart. J. Math., Oxford Series, 17 (1946), 162). 


Proof. AA’ is non-negative (see Albert (1946), p. 64), and hence its roots are non-negative 
and can be written A?... A?. The rank (.4) = r and hence there exists an orthogonal matrix [' 


such that x2 0 
‘AAT = Se = A? 
0 Az 


of the theorem (see Cramér, 1946, 11-9). Since we can rearrange the position of the A’s 
in A by an orthogonal transformation we can assume 


Ay 0 : ' 


»_|o 2 _ TAR 0 
MS = 0 ae © 
0 . 








0 
and | A, | +0. é i 
Write [ = (I'",T,), where [,: p xr; T,: px p—r. Then 


J , ’ ’ ’ 2 
A2=lI°AAT = i] AA'(T,T,) an PAA rT; [AA rr, Aj of 


T,AAT, TAAT,| LO 0}’ 
(5 A. 4-1) I. AAT, = A3, 
(5 A. 42) lr, AAT, = 0. 


Pre- and post-multiply (5 A. 4:1) by Ay?: 


(5 A. 4-3) Ay *T, AAT, Az? = £. 
Let A, = A'T, Az? (A,: qx7r) 

and then 

(5 A. 4-4) 4, A, = AT;, 

(5 A. 4-5) AjA, =, (by 5A. 4-3). 


From (5 A. 4-2) (TA) (TA)’ = 0 and hence [, A = 0. 

From (5 A. 4-5) we see that the columns of A, are orthogonal and hence there exists a 
matrix A, such that 
(5 A. 4:6) A = [A,A,] 


is orthogonal. The matrices [' and A satisfy the theorem, since on performing the multi- 


plication we find I’A =(A0)A’.  Q.E.D. 














——— ee 
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5 A.5. An elementary transformation on a matrix B is equivalent to Z, ... Z, BE, ,, ... Ey» 
where some of the Z’s are of the type 5 A. 1, and the remainder are of type 5 A. 2. Hence if 
B is square and n.s. we can write (by 5 A. 4) 


B = E," ... 3; 'E,,' ... Bot; 
om FF1--- Fy 
where the F’s are of type 5 A.1 or 5 A. 2 (see Frazer et al. 1938, § 3-12). 
5 A.6. Diagonal matrices are commutative (see R. G. Frazer et al. 1938, § 1-6). 
5 A.7. If A and D are square, not necessarily of the same dimensions, and | D | +0, then 





: DI | D||A-—BD-~C| (see Hsu, 1941-2). 
Proof. 3 5|- (4 at. "|- fer® ~BD-“C | 
|c D Cc D Dc I 
=|D||A-BD“C|. qx. 
5A. | 1+2zy'|=1+9'% (see Aitken, 1939, § 37). 


5 A.9. If Zis n x 1, there exists an orthogonal matrix ’: n x n such that z’T = (0, ..., 0, a), 
a>0, with a = Oif and only if %’ = (0,..., 0). 

Proof. If %'+(0,...,0), choose [ as any orthogonal matrix with elements in the last 
column 


L —— 


i=l 


n —4 : / 
xfs 4) (S=41,2.., 8): 


Then 2’T = [o.....0.(% x) |. If z’ = (0,...,0), any orthogonal matrix will do, and 
.1 


a=0. Q.E.D. 

5 A.10. If M is any mx m matrix, there exists an orthogonal matrix A: m x m, such that 
MA is triangular with 0’s below the main diagonal, and with diagonal elements > 0. 

Proof. By induction. If M is 1x 1, then A = +1 will do. Assume true for a matrix of 
dimensions (m—1)x(m—1). Take [' so that the product of the last row of M into I is 
(0,...,0,a@),a>0 (by 5 A.9). Then 


MT = ry 1 (M,: (m—1) x (m—1); 6: (m—1)x 1; a: 1x 1). 


By the hypothesis of the induction there exists an orthogonal matrix I’, such that MT, is 
triangular with 0’s below the main diagonal. Let 


Tr, 0 
ser[™ 
4 r, 6 by M, b|[T, 0 “< e't ‘| 
Then ma = ur, =['o ig |= 0 al’ 


which is of the required form. A is the product of two orthogonal matrices and hence is 
orthogonal. Q.E.D. 

5 A.11. If D, is the adjoint of any determinant D, and M and M, are corresponding 
m-rowed minors of D and D, respectively, then M, is equal to the product of D™~! by the 
algebraic complement of M (see Bécher, 1938, § 11). 
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5 A.12. If A: mxnand B: nx m, then tr(AB) = tr(BA). 


mn nm 
Proof. tr (AB) = 2, > 3 5 = p> > by Ay => tr (BA). Q.E.D. 
i=1j=1 i=1j=1 
Derrnition 5 A. 13a. If f(2,,...,2,) is a differentiable function of x, ...z,, then 
df = 3 5 of - 
is called the differential of f. im18 


Deriition 5 A. 136. If A = (a,;) (¢ = 1, ...,p;j = 1,...,q), where the a,; are functions of 
(a, ...%,) = ;,(2,...2,), then 


dA = (da,;) (t=1,...,p;j =1,...,g). 
5 A. 13. d(AB) = (dA) B+A(dB). 
Proof. d(AB)= a> 45554) = (2 da. 4355x)) 


= a l(da ij) Oj, + %j(dby,)] = (dA) B+A(GB). Q.E.D. 


5 A.14. If A is square, n.s.,d|A| = | A|tr[A—(dA)]. 





Proof. Let A- = (a4). 
(5 A. 141) ai = (cofactor a,,) | A |-, 
—0|A| si 
(5 A. 14-2) d|A|\=>y> da,, (by Definition 5 A. 13a). 


a i] 0a,; “ 


2| A| 


(5 A. 14-3) We = cofactora,; = | A |4 ait (by 5 A. 14-1). 


Substitute (5 A. 14-3) in (5 A. 14-2): 
(5 A. 14-4) d|A|=|A|>Saida,;. 
ij 


The (i,%) term of A~\(dA) is Ya" (da,,), and tr A“(dA) = Y Ya" (da), which when put in 
(5 A. 14-4) gives the result of ‘nd theorem. Q.E.D. mn 

5 A.15. If A is square, n.s., (dA!) = —A-(dA) Am}. 

Proof. d(AA-") = (dA) A-!+ A(dA“) = 0. QED. 


5 A. 16. A p.d. matrix V: p x pcan be factored uniquely into V = 17", t,,>0(i = 1,..., p) 
(see Mahalanobis, Bose & Roy, 1937). 


5 A.17. If U: px pisn.s., then UU’ is p.d. (see Birkhoff & MacLane, 1947, p. 246). 
5 A, 18, If is orthogonal and | I’ + I | +0, there is a one-to-one relation between I and X: 


P= 2X4+/)1- 
X =26+)7- 
Proof. Show that X is skew symmetric, i.e. X+2' =0. 




















3 of 


p) 
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5 A.19. UU’ of 5 A.17 can be expressed uniquely as: 7'7", where t,,>0 (i = 1, ...,p). 
If | 7-U + I | +0, U can be expressed uniquely in the form 


U = TX +1)>-7). 


Proof. Follows from 5 A. 16, 5 A.17 and 5 A. 18. 
5 A.20. Given I asin 5 A. 18, then 


aV = —}(P +1) (dX) (P+). 


» War % 
5 A.21. Let Tait” 
Vn Vo 
be p.d. (Vi;: px p; Vio: pq; Vos: ¢ xq) (p <q). Then there exists a matrix G of the form 
G, 0 
¢ ‘ad (Gy: px p; Gg: qx) 
— 4 s 
such that GVG@ == [3 \ 
0 4 


where D,: p x p has elements 7, ...,7,, and rj ... 7}, are the roots of the determinantal equation 


| Vis Vog'Ve, — 9V,,, | = 0 (see H. Hotelling, 1936). 
Proof. Since V is p.d., V,, and V2. are of rank p and q respectively, and hence by 5 A.4 
there exist two non-singular matrices F, and F, such that 


(5 A. 21-1) EVE: = 1, 
(5 A. 21-2) F,V,.F; = I,. 
Let A = F,V,,F; then by 5 A. 4 there exist two orthogonal matrices I’, and I’, such that 


i *¢ 
(5 A. 21-3) PAM ef ns) Of, 
0 A, 


where A}, ..., A2, are the characteristic roots of AA’, i.e. roots of | AA’—AI| = 0. 
Pre- and post-multiply (5 A. 21-1) by I’, and I’; respectively, and pre- and post-multiply 
(5 A. 21-2) by T, and [3 respectively. Then 


(5 A. 21-4) LAY Fil; =". Zl%= 5, 
(5 A. 21-5) lla Fil; = 0,27; = ZL. 
But AA’ = FV, F3 FF}, F;, and from (5 A. 21-1) and (5 A. 21-2) 


Saas V,-FOF and Vz=FiR. 


enor AA'—AI = Bhai ~AF FF 
- Fv. Vay —A V3] F;. 


F, is n.s., so | AA’—AI| = 0 is equivalent to | Vj. V4'%, —AV,,| = 0, and hence Aj, ..., Aj, 
are the roots of this last equation. Hence 


a 
(5 A. 21-7) 0] =(D,0). 
0 A 


24-2 
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Let G, = TF, and G, = TF; then G, and G, are the matrices of the theorem, since by 
substituting these values in G and multiplying out GVG’ we get 


AVP, TAM PT: 
PMP, AV Pls 


which by (5 A. 21-4), (5 A. 21-5) and (5 A. 21-7) is 


ay ae 
[3 pt |: oD 


(5 A. 21-8) GV@ = 


0 q@ 


B. General properties of Jacobians 
1 
5 B.1. 2 Te : , ; 
D(Y; X) Dx: ¥)’ if D(X; Y)+0 


5 B.2. D(X; Y) = D(X; U) D(U; V) D(V; Y), where U and V are any functions of X 
and Y such that none of the terms on the right vanish. The extension of this to more variables 
and to the introduction of functions of U and V is immediate. 


5 B.3. Given a matrix transformation, linear or not, Y = F(X). Then the transformation 
of the differentials, dY = dF(X), is linear in the differentials, and the J of the original 
transformation is the J of the linear transformation in the differentials. 


Proof. Y = F(X) implies a set of transformations in the elements of Y: 


Y, = F(z, ...%_) (6 = 1,...,m). 


Let (dy)’ = (dy,...dy,,); (dx)’ = (dx, ...dx,). 
Then the equation dY = dF(X) 
is of the form (dy) = G(dz), 
of; ge ge 
where G= ze | (é,j = 1,...,m), 


and G is by definition the J of the original transformation. Q.E.D. 


5B.4. If X = F(U) and Y = F(V) are a transformation from variables X and Y to 
new variables U and V, then 


J = D(U,V; X,Y) = D(U; X) D(V; Y). 


We wish to express our gratitude to Prof. Harold Hotelling for his help and encouragement 
in the preparation of this paper. 


We are grateful to Dr Ralph A. Bradley for the use of his lecture notes. 
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TWO APPLICATIONS OF BIVARIATE k-STATISTICS 


By M. B. COOK 
University College, London 


1. In a previous paper (Cook, 1951) the derivation of bivariate k-statistics and product 
cumulant formulae up to weight 6 was discussed. In the present paper these formulae are 
used to find approximations to the moments of the sampling distributions of the correlation 
coefficient and the regression coefficients under the very general assumption that the 
bivariate surface generating the samples has as many cumulants as desired. 


2. The sample correlation coefficient. It is assumed that there are two correlated variables 
z and y. A sample of n pairs of observations (x,y;) is available, whence the sample correlation 
coefficient is commonly written 


__2(z-%)(y-9) 
[Dez L(y—y 


the summation in each case extending over n. This expression is, in terms of bivariate 
k-statistics and population cumulants, 


ky Ki ( ky — “u) ( kao — <a)" ( kos — <a)" 
= = 1+ 1+ 1 +808) l 
(Koo keo)* (Kao Xo2)* Ky K20 Ko2 ”) 
This expression can be expanded as the product of two binomials, a process which, it can be 


argued, will be valid in all but a very small proportion of cases provided n be large. Expanding 
and taking expectations we have 


&(r) = {1-3 1 K(; )n3 K(; +3 2 +3 3 1 22 
OV Skike A 0 oe +e 0 2) * 8x3, “lo 0 
tat, +3 1 Koo) ta 1 Se ee 1 120 
8x2, \2 2) 8x,,K%, \1 0 0) 8x, K2, \1 2 2) 4k KagkKoq \1 0 2 
Me 1 (220 _3 1 eee alo oo Tea"(s 3 2 
16 K3Kyp \0 0 2)  16KyK3, \0 2 2) 163, \0 0 0) 16x3, \2 2 2 
sat 0) "(0 0) ~T8mae% 41 2) *(2 2)~8 sate ao 
—— 10 16K, «3 02 8 Ki K50Koe 
Smaebea "(0 0) “(i 2) ~saammrds“(i 2)“(0 2) “i ov ee 
eee 00 8 K11 Ko Kee ea 22 
15 «(5 (6 3) +33 1 K(, «(3 ) +a S. 4 
ax 00 32 Koki 0 2/ \2 2 64K2,K2, 
0 
2 


ea (3 ) +108 Ft (2) 4106 at a(° 9) . 
32«2,«2, \0 128K4, \0 0) * 128K4, 2)’ (2) 














x 








ate 
(1) 


| be 
ing 


(2) 
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where p is the parent correlation. Substituting for the cumulants we have on reduction 


vier) = of ; nit (S rent) eke (se tet) at (5 5) a 


~ Qn 82" \8n 16n?) 3) \8n- 16n2) «2, \2n 4n®) Ky Kop 











-(5- a3) Kis +(a+ea) Kes (A 3) mh 58 Keo 5) Kae 
2m 4?) Ky Kg, \4n 8?) KagKo. \2m 4?) KopKyp 160? 3. 160? K3, 


2 2 2 
— Mae, P Mm 3, Mes cB Kp cc 9 Mahe 9 Kyte 3 i 
2 2 22 ix x9 22 ee | 2,2 
4n®K3, 4n? Ki. 16m? K3gKo. 16M* Kygk2. 40? K2)Kop 40? Kop K2, 47 K35 Koo 

2 
3 Kis 3 Ks 3 Kis 3 KgoXo1 , 3 KosKie, 1 K33 


—— - + — — — 
2 2 2 2 2 2 2 2 572 2 2 
4N* Koy Khq 80° Ky) K3q BN? Ky, Kp 2N* Ky K3q 2M? Ky Kop 4N* Ky Kay Kon 














2 5 of Be 2 
AT ky Key 105 Kay 105 Koy 15 KggKog § 15 Kogkog 15 Kay k?y 


N®Ky,Ko9Koq 128n* Ki,  128n? KG, 32N7 K3yKog 32M? Kygkgo 16? K3) Kyo 

















2 2 KA 2 

15 Koskiy 9 Ke , 9 li 9 Keekt 9 KyoKoa 15 KyoKgy 
2 3 Ont x2.Ke 242 42 2 2s acs 2,2 x2 2k Kk 

16N? Koo Kpq «32N* Ki.K5q 8M K3gK5. 87 KigKiy 64m? KZKR. 16n* Ky, Ko9 











_ bb KosKig_ 3 Keokig 3 Keeks, 3  KgiKog 3 KisK40 





—B_S 3) 
2 2 2 2 2 2 2 2 2 r ( 
161? ky, Keo 8M? Ky, KogKep 807 Ky K3yKoq 16m? Kk, K39K gp 160 Ky, K3y Koo 





‘Again by expanding 


pr(i+ =n)’ (14-2) (14-0 Se)" 


Ky K20 Koz 
and taking expectations, we have, to order n-?, 
, 3. 3 1 5\«x ‘il 8y« 1. ie 2 -4S\. xe 
pier) = e*[1-5 454 (5-3) t+ (2 -S) a+ (+) 2 +(--3) AL 
(on n\n 2) Ky \n 2) 2g \n * n?) Kap Kg \n 2?) Kay Koo 
a (20 2) Ae 1 £9 12 *_4(°-5 Kon, (E , 1) MaeXes 7 Mag 1 Kes 
n® 1) Ky,Keg: \n® n).Kyy Keg \n_. n8Ke, \n., 2) KE mt Ky on? 


2 2 
4K3 403 1 Kap 1 Kyg — OK Kg, 16K Kg 4 Ky 


2 2 2,2 2 2 2,2 2 2 2,2 
n> K3g 1? Kig N KigKog 2 KapKoe2 N° KigKon =” KagKig 1 KigKoe 




















2 ed . 
4 Kip 2 Ky, 2 Kis, 8 KgoKo1 | 8 KosKig 2 Kg BK yeKay 


oO 








2 2 2 2 2 2 2 2 2 2 2 
N"KogKog N° Ky, Koq N° Ky KQg = Ky Kaqg MK Kg = N° Kyy KogKog =" Ky Kon Ke 


: 2 2 2 
1 Ky 1 Koy 2 KgoKig 2 KosKn 2 Kh 2 Kien 3 Kao 


io ae ” 2-2 2-2 gala 2-4 
NK Keg NK Kog = NK Kog9 NK Ko: NK Keg NK Kon = NK 























2 2 2 2 

3 Koa , 3 KaoKae 3 KosKoe , 6 KyoKiy 6 Koakin | 2 Koo 
234 2 2 2.3 Tee aie 

M7 Kog NPKIgKo2 N*Ka9Ko2 ”KgKog MN? KaqKoz 2” Kio KGa 


2 4 
Skok}, 8 Ky A KyoKos 6 KyoKs1 6 Kogkig_ 4 Ko9Xs1 


2,2 © "pte 2 Qed pd 2 a 2 2 es 
N"KogKo9q N° Ka9Ko9g «= NKa9Kog «= Ky Ka 1 Ky, Koq 2? Ky, K3Ko2 























2 
A KogKo2 | 2 Ksikis_ , 1 Kae \ 


st (4) 


2 2 2 2 m2 ee ‘ 
N* Ki Ko. N?KiyKapKo2 Ki, Koo Koa 
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The crude third and fourth moments to order n~? follow in similar fashion: 


3(r) = p(t = 


, 


Ml’) = 


2 
p {1-4 + (5-5) Se E-S) Sor E45) Koo +€-5) Kil 


15 261 | (15 _ 285) Kyo, (15 _ 285) Kos 2 Koo 
2n 8n? \8n 16n?/) Kz, \8n 16n?/K2, \4n 80?) KopKoe 








9 225) x2, (171 9\ ky (171 9\ ws . (3 AM) Kop 
+(5--75 + >= +(—>-—)—* +(- —# 
Qn 4n?) KyKog  \4n? Qn) Kypky,  \4n® Qn) Kogky,  \n 4n?) «2, 


+(; aa) a? 35 Key 35 Kog_ 35 Kao _ La ee 


Kz,  l6n?K3, 16n?K3, 4n?Ki, 4n7K3, 16n? Kip Kyo 161? Koo Ko 


225 KyK3, 225KyKyg 45 KR, 45 Kip 45 Ks, | 45 Kys | 45 Kyo Koy 











2 2 2,2 ~~ ‘Ant 2 2 2" on2 
4? K2yKoq 4M? Ko K2q  4N®K2yKy, 4M KogK2q 8M Ky, K3q 87K, K3, 2M? Ky, K3y 


45 Kook, 27 ky 427 Knkis 9 Ka 9 Ku, 9 Koken 


2 22 2 2,2 2,2 2-2 
2m? Ky Kpq 47 Ky KogKog =? Ky KogKoq §2N* Ki, Koq  2N? KI Kg = 8N* KT Kao 








2 2 
9 Koak20 9 KgoKig 9XKosKn 9 Ky 9 Kip 
2,2 2-2 22 2,2 2,2 
8N* Ki Kog N*KTKoq N° KI Kog = N°KiKoq "KT Koo 











lk 
n 


Ks3 
Kh 


3 KepKig 3 KegKn, Kos, 3 KygKo1 | 945 Kay 945 Koy 315 KyoKoe 


24> “Pt +a5 
Ka, | om? x3, «| «1 28n2Ks, | 128n2 Ks, 32n2 KE, Kop 





1k, 
Qn? «3, an? KB, | Cn? 





315 KogKoe , 315 Kggki, | 315 Kygkhy | 225 Koy | 225 KygQKki, | 225 Ki, 
B2N® Kygk3. | LON? K34Kyg | LON? Kagk3q 32m? K3,K2, 8m? K2yK2, Bn? K2yK2, 














225 KagKoq 315 Kyg Kg, 315 Kygkyg 135 KyQky, 135 KagKyg 


64n2 2 2 2 2 2 
B4n* K3oK52 16m? Ky, K3q 16? Ky, Kjp 82" Ky K3qKq2 8? Ky, Kag KGa 








2 2 5 

135 KyoKi3 1385 Koy Kay 45 K3; 45 Kis 45 K4agKoo 
4n? x2. K2 2x2. x2 22 x2 

16n? Ky, K3o Koo 16n? Ki, KopKeq 40? K2,K3,  4n?K2, K2, 80? K?2, «3, 











45 Kogkog | 27 KgiKig 9 KgiKog_ 9 KygKo | 27 _ Ke | 
2,2 ,2 2,2 2 
Bn? KT, Kog 2N7 Ki, Ky9Kgq 2n : 2n? Ki Koa 4n? K4, KeoKoal ’ 

















2 2 2 2 
n n mn?) K2, \n mn?) Ke, \n 1?) Koko \n Koo Koo 








n n* 


n n2 


nt n 1 Ki, on®Ki,  n?K3, 


+(4-5) K3y +(-5) Kis, +(° a) Koo ( = KeoKo2_ 4 Keo 4 Kog 


2 
Ky1K29 K11Koe2 Kh 





2 2 2 
16k3, 16Kjg 6 Kyo 6 Koy 144K) Kg, 144K, Kg 24 KY, 
2 23 2 2 2 72 2 2 2,2 

N* K3g N* Kig N*K3qKog N*KygKig 1” KigKy: 1 Koo KGq 1” K3gKog 





24 Kip 12 Ks, 12 Kys | 4B Kg9Kay | 48Kogkig 16 Kgs 64 Kiko 











2 2 2 2 2 2 2 2 2 2, 2 
TN" Koo Ko2 2° Ky K99 "Ky Kon 2" Ky Kog "Ky Kog 0" KyyKa9Kog =" Ky Kan Koo 


2 < , 2 
12 Kyo 12 Kag | 6 KyoKor | 6 Kogk29 24 Kg9Kig_ 24Kogko 24 Kh 











2 2 272 272 2,2 2,2 2,2 2,2 
N° Ry Keq UK Kog NK Kog NK Kon NM KYyjKoqg N° Ki Kgg =" Kj, Koo 


§ 2 Bee 
_ 24 Kip 43g 12Ky9Kig_ 12 KoeKgi | 4 KaoKog | 12 Kako | 15 KG 














22 2 2 2 2 2 2 
m? Ki, Kon MK, nm? Ki om? Ki, om? Ki, om KG, K4y 


(5) 


ee 


Koy 
2 
Ko0 


(5) 
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15 kog 24 KyoKo2 24 KggK og 48 Ky Kh, , 48-Kog KF, 18 KR 


ne ne ne 3 ox 
Mn? kby | 1? KEyKoq Kaoko2 1 KigKo, 1? KaoKjg N? Kio Kos 








72 Kk}, 72 Kh 9 KyoKos 48KsoXs1 48KogKig 48 KgiKo9 
272 ,2 22 ,2 2,2 ,2 2 3x. xt 2, 2 
N” K3oKG2 N* K5gKbg N*KigKig 7 Ky, K3q 2 Ky Kbq 0” Ky, K3qKog 











48 Kigkg 24 Kyokis — 24 Kogks, | 36 KG, 36 Kis 18K gg K oe 


2 2 2 2 2 2 2,2 ,2 2,2 ,2 2,2 ,2 
1" Ky Ko9Kog 1" Ky, Ka9Kog "Ky Kon Kon "Ky Koq N° KY Kong «1° KI Kog 





2 6 
18KoyKo2 48 K31K13 24 Ky 24K 3) Kog 24 Ky 3K a9 3 ke 
22 ,2 2,2 2 pe 2 7) 
M? Ki, Ko2  N” KiKagKog NK KagKog > KiyKoq Ki Kgg 24 
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2 pe 
3 K30Kb2 arid 


mk om Kh 
3. For the case when the parent population is the normal bivariate surface 


1 
P(xy) = a | - Xi —p) (x? — 2pay +) : 


Ky = Ko, = 9, Ko =Kog=1, Ky =p and-xk,,=0 for p+q>2, 


and the expressions obtained for the first four moments of r reduce to 





(p) = ft RS oe Fa? 
Ay(r) = pll-5, Bn? +P (s.- a Ad mat (7) 
3 3 2 12 |: ae 8 
te SP ree Ba ee et 
Ha(r) = p {= +ratP (- wa) tala tsa) +P ate (8) 
1/3 9 15 261 9 225\ 225 
, je MP tens 0 ee es ee ee pa a ta a Aen B. 
wate) = (5 mn) +) On * Bn2 +? (5 aan) tage lt (9) 
13 1/6 36 14 129 8 168\ 72 , 
4(r) = p*{——+— [--— ——+—  +p*|--—] + sp") +... 1 
Hal”) Pama t al =) +1 nt mt? ( at) tye] t c (10) 
from which we deduce o%(r) = =(1- p?)? (+5 +—+ =), (11) 
6, 
Hs(r) = -F( —p*Y, (12) 
3 2)\4 
Balt) = a (1—p?)f. (13) 


The expressions for ~, and 4 correspond exactly (so far as order n-*) to those given by 
Soper et al. (1917), which provides a check on the work. 


4. To take formulae (3)-(6) any further (e.g. by using them in deriving a Gram-Chazlier 
expansion for the frequency function of p) it would be necessary to make some simplifying 
assumptions concerning the cumulants. The difficulty is, that in doing so, one imposes such 
constraints on the distribution that it becomes doubtful what significance should be attributed 
to the coefficient r. In order to illustrate how the derived moments may be used it has been 
supposed that the parent population generating the sample is a Gram-Charlier surface of 
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order 3 only, with linear regression and equal marginal standard deviations. Itis not claimed 
that this restricted case will be met with in practice. Writing ry to represent the normal 
parent result 


Hi(0) = wilrw) +P b+ Hb) 0") (— 55+ Tes ea’) 





~ Bn? t 16n3 ~ 16n2” 
KgoX, 1 9 45 
+See 1p) (— Fata t gah) + (14) 
1 5 59 
Ks0Kos (> 57 we) ne ( 2 2 “ook = 
2 Mn — —2=—} +... 


1 ; 9p? 9992 39p* 
fg(7) = fg (7?y) + 2 p(1—p?) {fo +48) (“e- oe oo) 








K3oXo3(_ 3p? 3875p? we 3p* 9p? Kso0 “es) 
~a0%es {| Fy = SP (14) (ic, +2, — 2-2) +, (16 
+ r ( i + a hat K3o + Ko3 r +..., (16) 








1 15 78, 51 
Malt) = Halty) +75 PUL — Pp") {(xfo-+ #83) (— 5 PP 3P) 


+ "2050 (15.4 698+ 123p)} +... (17) 


If the departure from normality is not too marked, then Ky, and K9g ‘vill not be large and these 
expressions should give reasonable approximations to the true moments of r. The hypothesis 
which, in practice, it is usually desired to test is that p = 0-0, that is, we are interested in 
finding out whether there is some association between our two variables. For this case the 
expressions for the moments simplify considerably and we have 


Ay(r) = Kwkwo( a+ gos) Hines (18) 
f(r) = ee (19) 
Hs() esos (20) 
f(r) = ate (21) 
giving Bir) = ( a2 *Resaite on (22) 
A,(r) = aT (23) 


In this case we note that, to the degree of approximation considered, the amount by which 
the parent population deviates from normality does not affect the value of £,(r), and that, for 
reasonable-sized n, £,(r) does not differ appreciably from zero. Some values of /,(r) have 
been calculated and are given in Table 1. The cases are restricted to those for which xj, = ks. 
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It would thus appear that the same Pearson curve which represents the distribution of r in 
samples from anormal parent population will also approximate to the distribution of r when 
the parent population deviates slightly from the normal. Hence a small amount of skewness 
in the parent population will not affect the test of the hypothesis that p = 0-0. 


Table 1. Values of £,(r) when parent population is not normal, p = 0-0 and K3y = Kg. 








x2 
” 0-0 0-2 0-4 0-6 
n . 
20 0-0000 0-0000 0-0001 0-0003 
25 0-0000 0-0000 0-0001 0-6002 
50 0-0000 0-0000 0-0000 0-0000 























5. As far as is known the distribution of the regression coefficients in samples from a 
non-normal population has not yet been investigated. If the regression of y on z is linear 
and, in the population, is of the form 


y= a+ Px, 
then we may write the regression line of y on z in the sample as 
y =a+t+bz, 
: : My _ ky 
where the regression coefficient b=——=—. 
My Keg 


Writing 5 in the form of k-statistics, we have for the regression coefficient of y on x 


¥ (:-3) (Y-9) 
b om i=l 
viz 





5 (,—2)? 
i=1 


ky 

_ 

a (. += *n| (1 + aot)" (24) 
K2o Ku Keo 


which can be expanded as a negative binomial for reasonably large n. Proceeding in a manner 
analogous to that used above to obtain the moments of the correlation coefficient, we have 
to order n-* 


’ Ky 1 2 | 3 k4o\ (Kao a1 4 (Kiyo  KaoKo1 1 (Kéo K51 
Hib ya) = S21 + (2-3 + 5 See) (Se—_Am_) _ © (ie Soon) _ = (ee , (25) 
en Kao \ m mn? K3o) \K3q Keoki) 7 \K3q KinKd0/ 1" \K3g Ky K3 
ref, 8,Ao(2_9) St Oe, Beh, ou (612) , 8 
K¥o n n*® Kio n? K41K39 


N? Kg M K3q 1? K3g Ky Kan 0” 
24 K 39K Fetes *) “ane (- ) 2 Kao, | KaoKoe 


2 2 2 2 ans) nips 2-2 
NM? Ky Kg 2” Ky K9g KI NKYKeq = N° K11 Ko9 











Ha(by\2) mani n Ns 








n n* 





ae 2 2 2-2 pp 22 pe 
NK Keq NK Keg NK Kg) 1K Ko9 





4 KyKig + Kh 3 Kyoko , 6 Kia |, (26) 
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j _ Kh 3 9 Ky (6 30 Kn 9 =) Koo (; a “a KooKoo ( *) 
Hs(Pyiz) = a, at nn Ra, ¢-3) +a (-345 Te, n m2)” xi, nn 


10K 40K% 18 Ks, | 72Ka9Ko, 9 Kap 9 Kyo Koo 18 s0K12 





2 2 ne 2 22 
n* Kig 1 no N? Ky, K3q 0” Ky KGq NgK7yKoq  N7 KG Kyq 0” KI Koq 


18_Kir_ 4 1 Kes | 3 Kaokis_ 6 Konkan , KaoKos 1 | 3 Kaka | 45i0 

mM? KE, Ko KE, nm? Ko? KY Ki, Wn? Ki, 1? K%y 

90 KyoKs1 36 KG, 18K yoKop 9 Kai Ko0) 

n? Kyo n? Ki, Ko WP KEKE, WKY Kao)” 

3 x4 6 15 Kky/10 80 K 16 128) k../6 66 

Hive) = Baa eas) tate (owt) Take) 
+ere (O45) 105 Kio 240 KsoKs1 , OO KsoK 2 | 38 KaoKos 120 K5, 48K 31 Kop 














(27) 





2 2,3 xt nike 3 3 xt oo 

mM? Kg? Ky K3q NP KT Kg MKT Kog = 0” KT K3q NN? Ki Kop 
2 2 

36 Kai Koo, 3 K3e 3 K3oKG2 | 6 KaeKapKo2 20Keq 80K 40 Ks, | 160 Kaka 



































2 2 2 "aii, aha of 2 2 
Me, Ki mK, on Kh n Kh nN? Kg 1? K3g NK K5g  ? Ky K3q 
2 
24 Kyg 48K g9Kig 48 Ko, | 4 Kgg 12 Ka9Kig | 4 KgoKog | (28) 
2 2 2 2 2 2 2 2 2 2 
M* Ki Koy NM? Ki Keg N KI Kog N?KY, N? KY mn Ki oN? Ky 
6. If we assume that the regressions of x on y and y on & are linear, then 
Kook pi = Ky Kpit,0 KoeKip = K11X%0,px1 Where p21, (29) 
and the expressions for the first four moments of b,,,, reduce to 
K 
, 11 
Hy (Oyz) oan (30) 
K 
20 
(bec) Ki,{ 1 3 <0 (7 3 2 Keo, 4K3o 3 Kin Koo (1 4 
PN vial = 2 \ nn Ky \n ~n?) * n?x3, n?x3, n?Ki, K2,\n_ n? 
4. K20Koz Te 2 Kis eye 3 KyoXK22 (31) 
K2, \n m2) n2K2, Koy N2K2,Koq 2? Koy Ky, Koo | N2K2)K2,)” 
1 \” 11X20 11X20 2011 Keg 20K 11 
b _ kh 1 6 X40 _ X22 g Keo 9 Kao _ Kao, KsoXos , Kaz 3 K20Xo4a_ 3 _Ka0Xoa 32 
a Ky KA Ry Ky Ky” KS, RL Kon oo 
20 202~Ci Ky 20 20 ies 11 ll KiiKo2 «= K11 Ka0Ko2 





Kt, K K Kook ee 
BL (b )= — {3+ 6-22 40 —6§-2_6 20 Koko . 9 Kio —6 40“ 22 
4\*ylz Kon K2 2 2 my 2 2 
20% 20 Kiy Kiy 20 Kt K29 





+ 3G gS 5 9 Sts genkyotn (33) 

Kh = Kin Keo 11 11 

to order n-*, 
When the parent population is a bivariate normal distribution, we have further 


’ K o 
Hy (by) =U = p-¥ 


Koo o,” (34) 
i’ tive, @ eS o < 
tale) = (o93) (eat) ~ (nts) OPM (35) 
fg (b viz) ad 0, (36) 
3 (ka «&,\* 3 o4 
Halbye) = 55 (get) = gl PPE, (37) 


which agree to our order of approximation with the known moments of the distribution 
of 6,,,, for a normal distribution. 
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Table 2. Values of £,(b), £,(b) for p = 0-0, x, = x2, 












































































































































n 20 40 60 
‘ 
Kio =| Bx(b) B,(b) A,(b) B,(b) B,(b) B,(b) 
(27) | 
0-0 | 0-0000 3-4000 0-0000 3-1714 0-0000 3-1091 
0-2 0-0002 3-3827 0-0000 3-1279 0-0000 3-1071 
0-4 0-0010 3-3656 0-0001 3-0852 0-0000 3-1052 
Kee 
K. 
; <4 Table 3. Values of £,(6), B.(b) for p+ 0-0, x2, = x2, 
- (a) p = 0-2, Kap = Kog 
20 | 
K5o 8,(b) 8,(b) £,(b) B,(b) B,(b) B,(b) 
(29) 
0-0 0-0000 3-4000 0-0000 3-1714 0-0000 3-1091 
0-2 0-0000 3-3999 0-0000 3-1746 0-0000 3-1120 
30) 0-4 0-0001 3:3996 0-0001 3-1777 0-0000 3-1148 
0-6 0-0019 3-3990 0-0003 3-1808 0-0001 3-1176 
(6) p = 0-2, Kp = — Kos 
ine n 20 40 60 
| | Ko Ay(b) Bx(b) Ay(b) Bb) | Bab) | Bal) 
| 
0-0 0-0000 3-4000 0-0000 3-1714 0-0000 3-1091 
0-2 0-0000 3-3573 0-0000 3-1559 0-0000 3-1001 
0-4 0-0004 3-3141 0-0001 3-1401 0-0000 3-0911 
33) 0-6 0-0010 3-2706 0-0002 3-1242 0-0000 3-0817 
(c) p = 0-4, Kap = Kog 
34) | 
n 20 | 40 60 
35) | 
Kio «=| «SA (0) B,(b) | B,(b) B,(b) A, (6) ,(b) 
36) | | 
37) 0-0 | 00000 | 34000 | 0-0000 3-1714 0-0000 3-1091 
0-2 | 00000 | 3-4062 |  0-0000 3-1765 0-0000 3-1130 
0-4 00001 | 3-4118 00000 * 3-1809 | 0-0000 3-1167 
ion , 0-6 0-0003 | 3-4168 0-0000 | 3-1861 |  0-0000 3-1204 
| | 
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7. For the same double Gram-Charlier surface as before (linear regressions and equal 
marginal standard deviations), we have for the moments 





Hy (byz) Ps (38) 
1—p? 2px. 15) 6 
Pa(byi2) = —f + - (Ko3 — PK 30) (2 ae 4) + 73 X30(Ks0 — PKo3); (39) 
9p 9 ; 
H3(byiz) = 3 <30(Ks0 — PK 3) + 73 Ks0( l — 2p*) (Ko3 — PK 30), (40) 
3(1—p?)2 129? 36 
Ma(byiz) = ros Soe + — (Kos — PK go)” + — (K39 — PKos) (Kos — PX 0): (41) 


The /, and £, of the distribution of b,,, have been calculated from the above expressions for 
various values of p and Ko, K39, and it appears that a small amount of skewness will not 
seriously affect the distribution of 6. Tables 2 and 3 show some values of /, and /, for certain 
values of p, Ks9 and Kg. 


8. It is clear from a study of these tables that we might expect the effect of skewness in 
the parent population on the sampling distribution of b,,, to be slight. However, we may 
really only judge this effect by approximating to the distribution of b,,, in some way (for 
example by a Pearson curve with the correct first four moments) and finding out whether 
the nominal significance level is affected to any appreciable extent. This procedure was 
carried out for each of the above cases considered, and the results show definitely that nowhere 
were the 5 and 1 % significance levels altered enough to invalidate any test of significance 
carried out on the assumption of parent normality. We may therefore reach the conclusion 
that distortion in the way described above of the normal parent population is unlikely to 
lead to error in tests of significance. 


I should like to thank Dr F. N. David and Prof. M. G. Kendall for helpful suggestions in 
preparing this paper for press. 
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INTERRELATIONS BETWEEN CERTAIN LINEAR SYSTEMATIC 
STATISTICS OF SAMPLES FROM ANY CONTINUOUS POPULATION 


By G. P. SILLITTO 
Research Department, Imperial Chemical Industries Ltd., Nobel Division 


1. Statistics which consist of linear combinations of the observations in a sample have 
been called by Mosteller (1946) ‘systematic statistics’. In one class of these the law of 
combination of the observations is always the same for samples of a given size. It is evident, 
therefore, that linear systematic statistics of this class must be linear combinations of each 
other. Their expectations will be linearly related, and the variance of one will be expressible 
in terms of variances and co-variances of others. These relations will hold for samples from 
all continuous populations. In this paper, expressions are obtained for the expectations of 
some linear systematic statistics of this class in terms of expectations of others. 

It is found that certain linear systematic statistics which measure dispersion, such as 
Gini’s coefficient of mean difference, have expectations which are independent of the size 
of the sample from which they are calculated, and linear systematic statistics having this 
property are derived which may be used as indices of skewness and kurtosis for samples from 
any continuous population. Statistics with this property have advantages, for instance 
when estimates from samples of different sizes are to be combined. 


2. The expectation of the range, #,, in a sample of n from any continuous population of 
variate-values z, which has distribution function F, is (cf. Kendall, 1943, p. 223) 


ma = [a-a-Fy— Pryde. (1) 
Similarly, D,-4 = fa —(1-F)"1- F"-}dz, 
the integrals being taken over the whole range of z. Therefore 
“= | {(1—F)" F + Fe-\(1— FY} dz + Wy, 


_ (Xn,1 = Xn, n-1)/” - Wn—-w (2) 


where X,,, is the expectation of the difference between the (p+1)th and the pth value 
of z when the sample members are arranged in ascending order of magnitude. It is known 


(K. Pearson, 1902) that s 
Xn,p = (") [ra — F)dz. (3) 


Obviously @,,_, in (2) can be expressed in terms of ¥,_1 1, Xn—1,n-2 aNd W,_2, and so on. 


3. From (3) 


= (*) I Fr-? (1—F)? dz = Xn, p» (4) 











378 Interrelations between certain linear systematic statistics 
and by using this relation repeatedly on the terms of its own left-hand side it is found that 


=o y@ —pt+l 4 3) pt+i1)(n—p+2) 
np — —v)ip! Xn-v, p—v — et ee v+l Xn-v+1, p-v (n—v+ 1) (n— v +2) Xn—v+2, p-v 


1p SPANO pt -. inp ty) 
(n—v+1)(n—v+2)...(n—1)n *mP> 











+(- 
In the special case of vy = p—1, (5) becomes 


ols [Xn—prna _ ct Xn-p+2%1 tT Xn—p+3.1 _ be p-1Xaal 
rao = (5) (Ra (; aes ea Oe Fors pee a es 


4. By using (4) in the right-hand side of relations like (2), other expressions for W,, in terms 
of mean differences in samples of size n and smaller can be obtained; and from such expressions 
one can also express X,, p+ Xn,n—p in terms of mean ranges in samples of size n and less. The 
relation obtained is proved quite briefly in the following way: 


~3 (= 19?) War = [-[B(-17 (7) -2(- 1?) GF e—5(-1y (?) Pee ae 


= [- (0-1-0 Fy-e Fe— Pm — Pryde 


n 
> Pine (Xn, p+ Xn, n—p) (7) 
The form of this expression which arises in a symmetrical population, in which x, , = Xn,n—p> 
was proved by E. S. Pearson (1926). 
For n odd, we have from (7), taking p first equal to 1 and secondly to n—1, 
1! (n—1)! et 
reo (Xn,1 + Xn n~1) = Wy- Wn-1) 
(n—1)!1! nm—1\ _ m—1\ _ m—1\ _ 
ee aan (Xn, n-1 + Xn, 1) = —U,+ 1 Wn-1 — 2 Wn-gt.-. + 1 Wp. 
1 —1\_ —1 as 
Therefore Wn “ D) ((" 1 ) Wr-1— (" 2 )w Wy-gt +--+ (" l ' a, = 30, _1, (8) 


and similarly a number of relations between mean ranges can be obtained by giving other 
values to p. By means of (7) the relation between mean ranges derived by Romanowsky 
(1933) for a normal population can be proved for any continuous population. 

A special case of (8) pointed out by Robbins (1944) is 


Wy = $y. (9) 


5. K. R. Nair (1950) has proposed the symbol j,,) for the difference between the sum of 
the largest r and the smallest r members of a sample of n (1 <r < 4n or }(n— 1) according as 
n is even or odd) which was suggested by Jones (1946) as a measure of dispersion in large 
samples. We shall use ,, j,) to denote this statistic in a sample of n, so that 


nde) = (en tena t +++ +2 ppg) — (Sp +21 +» +2) 


= (2, — 24) + (Zp—a— 2p) + +++ + (pps — 2) 





T_T 
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—p? 
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Therefore 
Ende) or Wy, + (W, a Xn,n-1 bie Xn,1) tot {W, =F | + Xn,1) 
baer s, n—2 + Xn,2) id iti (Xn, n—r+1 * (ee (10) 


By means of (7) the right-hand side can be expressed as a series of mean ranges. The 
coefficients of each of the @,,_, (0<s<r—1) can then be summed, giving 


Einjo) =" (— wy (27271) (") (11) 


r—s—l 


6. Ifz, , is the pth difference between the ranked individuals in a sample of n, the mean 
deviation from the median in the sample is, if n is odd, 


ee : 
mM, = 7 tnt 22n 9+ weet 3(n—- 1) 2, ¥n—v + $(n— 1) Ln n+p + weet 22n n-2 TX, ow 
if n is even 
i 1 
m,, = a itnat Qin gt +e + (4M — 1) Xy gna + Nn an + (4M — 1) Sq gngat ++ + 28y, n-2t+ Fn, n—1}- 


By taking expectations and using (4), it is easily found that 


, , 1 
E(my_1) = E(my) — 2(2k — 1) (12) 
and that E(my;,4.1) = E(m,). (13) 
The latter relation has also been proved by Godwin (1949). Thus we obtain 
A 4 ee 
E (M3441) = (m3) = 2 Or —] Xen” (14) 


which can also be expressed in terms of mean ranges by use of (7). 


7. From the n—1 expressions obtainable from (7) by giving p the values 1, 2,...,n—1 
there arise n — 1 independent equations relating W,, Wg, ..., Wy, ANd Xp 15 Xn, +++» Xn, n—1- _Lhese 
equations can then be solved to derive explicit expressions for W,, (2<m<n) in terms of 
mean differences in a sample of size n. The general expression obtained is 


PO (n—m)!"—™ (n—p)! 
Om = 21 Xnp nm! pai (n—m—>p)! 





(Xa.» + Xan-e> (15) 


which can be verified directly from the integral expressions for w,, and x, ,. Relation (15) 
was obtained in another way by E. S. Pearson (1926). 
If in (15) we put m = 2, we obtain after some reduction 


2 
Ws = Fy) OO Xn t 2-2) Xnat --- + PMP) Xnyt --- + (H— 1)Xn.n-1}- (16) 


The right-hand side can be recognized as the form taken by the expectation of g, Gini’s 
coefficient of mean difference (Gini, 1912), when this is expressed in terms of successive 
differences of the ranked variate-values. 

Biometrika 38 25 
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When the values x, , of the pth successive difference (p = 1,2,...,n—1) between the 
ranked members of a sample of n observations are inserted on the right-hand side of (15) 
instead of their expectations y,, ,, the quantities so obtained may be regarded as statistics 
which are generalizations of Gini’s coefficient of mean difference. We may refer to them as 
the g,, statistics, such that E(g,,) = @,, for samples of any size n >m, from any continuous 
population; thus 





n—1 (n—m)!"—"™ (n—p)! 
Im = Zn n! ie (n—m—p)! (Ln. p + Tn, n—p)- (17) 


The expectations of the g,, statistics are independent of the number of observations in the 
sample from which they are calculated, unlike Jones’s j,,) statistics, or the mean deviation 
from the median. 


Table 1. Variance of estimates of W,, from a sample of size n by methods (i), (ii), (iit) and (iv) 











Sample size n | 
| 
m Method | 
4 6 8 9 10 
2 (i) 0-230 0-135 0-096 0-083 0-074 
(ii) 0-363 0-242 0-182 —- 0-145 
(iii) 0-233 0-143 0-106 0-094 0-085 
(iv) 0-227 0-133 0-094 0-082 0-073 
3 (i) 0-517 0-304 0-215 0-188 0-167 
(ii) — 0-395 — 0-263 — 
(iii) 0-523 0-321 0-238 0-212 0-192 
(iv) 0-510 0-299 0-211 0-184 0-163 
4 (i) — 0-448 0-316 0-276 0-244 
(ii) _— — 0°387 — — 
(iii) — 0-475 0-351 0-314 0-284 
(iv) — 0-443 0-313 0-273 0-242 
5 (i) — 0-582 0-406 0-353 0-313 
(ii) — — — — 0-373 
(iii) — 0-606 0-449 0-400 0-363 
(iv) — 0-565 0:399 0-348 0-308 
6 (i) — — 0-493 0:427 0°377 
(ii) on aes a ssl — 
(iii) — — 0-533 0-475 0-431 
(iv) — _ 0-474 0-413 0-366 





























It is sometimes necessary to estimate the mean range in a sample of m from a sample of 
n (>) observations; for instance, in quality control work or when it is proposed to use 
@,, a8 an index of dispersion. The g,, statistics could be used for this purpose, and it appeared 
of some interest to compare their efficiency as estimators of %,, in the case of a normal 
population, with some other methods. Table 1 shows the variances of estimates of @,,, as 
obtained from a sample of n(>m) drawn from a normal population of unit standard 
deviation, by 

(i) Using g,, as estimator. The variance of the estimate is obtainable from the data in 
Table 2 of Godwin’s (1949) paper, since the estimator is a linear combination of the differences 
between ranked members of the sample. 
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(ii) Dividing the n observations into k subsamples of m and estimating W,, as the mean 
of the ranges in these subsamples. The variance of the estimate is 1/k times the variance of 
that from a single sample of m and can be calculated from Godwin’s data. 


(iii) Taking w,, as the observed range in the sample of n xd,,/d,,. The variance of the 
estimate is d?, var w,/d?.. (d, is the expectation of w/o in a normal sample of n.) | 


(iv) Calculating the standard deviation s and estimating w,, as sd,,. Using the maximum- 
likelihood estimator of the standard deviation, the variance of the estimate of @,, is 
2 a aed ee 
a,| Hn 1 ( rn) 1}. 
K. R. Nair (1949) has investigated the efficiency of g (g, in this notation) as an index of 
dispersion in normal samples, in comparison with the linear systematic statistic which is 
most efficient as an index of dispersion (Godwin, 1949), and found it to be high. 


8. As a linear measure of skewness it appears natural to consider the difference between 
the pth and the (n — p)th difference in a sample of n, or more generally a linear combination 
of such differences. It proves possible to choose a linear combination which, like the g,, 
statistics, has an expectation value independent of the size of the sample, and thus is specially 
suitable as a measure. 

We seek an expression 


n%(Xn, aa a 7 n2(Xn,2 - Xn,n—-2) Tesora a, Cad Xn, ual +... 


(with p = 1,2,...,4(m -1) if n is odd, and p = 1, 2,...,(4n—1) if n is even) whose value is 
independent of the sample size. If it exists it must clearly equal 34,(x3 ;— 3,2). If for all 
the permissible values of p we express X,, » — Xn, n—p in terms of first differences by using (6) 
and remember that ja em . 

X21 = We = FWs = 3(X3,1+Xs,2); 


the resulting equations can be solved for x3 ;— 3,2. In fact we obtain 


3(X3,1— X3,2) = py [ (ene — Xn,n—p) {(*-') 8) , (18) 


the sum being taken over the permissible values of p. This may be verified from the integral 
expression for x, p- 


Thus the statistic n—2\n—2p)\ | /n 
6.=3[ haere 0-2) AUC] 0 


is a linear systematic statistic which measures skewness in samples of n from any continuous 
population. Its expectation 4(x5 ;— 3,2) is the expectation of the difference (median-mean) 
in samples of three from the population. The more familiar measure due to K. Pearson, the 
ratio of (mean-mode) to standard deviation, is not independent of the population, which 
has to be known to determine the mode. 








9. It does not: appear possible to define kurtosis except by reference to some standard 
population. The virtue of the normal population for this purpose is not marked except when 
one uses measures derived from moments, and in considering continuous populations in 
general, the most natural one to take as a standard appears to be the rectangular, which is 
flat throughout its range. The smallest sample which could give an indication of the kurtosis 
is that of four values. In such a sample, an index of kurtosis relative to the rectangular 


25-2 
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population would be 4(x, ,— 2%, .+2,4 3), which has expected value zero in the rectangular 
population. One could therefore take as a measure of kurtosis a linear combination of the 
differences between the values in the sample, which has the same expectation as this 
expression, independently of the size of the sample. 

It is easily seen that 4(y,,—2X42+N4,3) = 5@,—9W, and the mean differences in any 
sample of more than four can be expressed in terms of #, and &, by means of (15). Thus 
a linear systematic statistic for measuring kurtosis would be 


Ky, = 59,— 99>. (20) 


In comparing values of this statistic (or of S,;) one would of course bring them to the same 
scale by dividing them by a measure of the dispersion such as the standard deviation, or 
@, if one wishes to restrict oneself to linear statistics. The expectation of K,; in the case of 
the normal distribution, with unit standard deviation, is + 0-1383, or if the distribution is 
rescaled by dividing by ib it is +0-1226. As another example, the symmetrical U-shaped 
Type I distribution with frequency function 





1 
2) = ——_ (0<z<1) 
is easily found to have 
- 4 12 4 1 
= = (1 =): = 2 2” 
so that K, has the expectation 
48 ,/2 





10 
6(1-=) = —0-07927 or —¥ (1-2) = — 0-09087 
7 7 7 


when scaled in terms of #, or o respectively. 


The author is indebted to Prof. E. S. Pearson for helpful suggestions and advice in 
presenting this paper. 
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THE DELAY TO PEDESTRIANS CROSSING A ROAD 


By J. C. TANNER 
Road Research Laboratory, Department of Scientific and Industrial Research, Harmondsworth 


1. IyTRODUCTION 


Suppose a pedestrian is waiting to cross a road. How long may he expect to have to wait 
before there is a sufficient gap in the traffic? The answer will depend on a number of factors, 
but an answer can be given if we make certain reasonable assumptions about his behaviour 
and that of the traffic. The mathematical theory which arises could be stated in more general 
terms, but for simplicity the application to the pedestrian problem is considered throughout. 
It is thought, however, that useful applications for it may exist in other fields. 

The simplest form of the process may be stated as follows: We have a road in which traffic 
is flowing past a point X, in one direction only, so that 

(i) the number of arrivals in a time ¢ follows a Poisson distribution with mean Nt, so that 
the probability that r vehicles pass in this time is 

e-M(Nty/r! (r = 0,1, 2,...); 

(ii) in non-overlapping intervals of lengths ¢ and t’, the numbers of arrivals are statistically 
independent. 

We assume, in other words, that vehicles arrive ‘at random’. The assumption of random- 
ness is not usually true in practice, but Adams (1936) has shown that the approximation is 
quite good under a fairly wide range of conditions. Pedestrians arrive at X at a rate n per 
unit time, also at random, and independently of the passage of vehicles. Each pedestrian 
starts to cross the road as soon as it can be seen that no vehicle will pass X during the next 
interval J, a time assumed for the moment not to vary from one pedestrian to another. In 
practice, pedestrians often arrive in twos and threes, these ‘arrival groups’ arriving 
sufiiciently at random, and the theory will apply in this case to groups of pedestrians. 
However, some indication of how the theory can be applied to individuals rather than 
arrival groups is given. 

Each pedestrian will be delayed for a time 7', varying from one pedestrian to another. 
Let P(T') be the probability of a delay greater than 7’, and let f(7'), when it exists, be the 


frequency function of 7’, so adP(T 
ft) = -— 


We shall derive for the above hypothetical situation the function P(T7') and the dis- 
tributions of the sizes of groups of pedestrians crossing or waiting at certain times. 

Some of the results are extended to the cases when there are two streams of traffic, when the 
critical gap 7 is not constant, and when we are given the proportion of arrival-groups of 
different sizes. The theory has been applied to estimate the delays which would occur to 
pedestrians and vehicles if ‘advance warning signs’ were erected at pedestrian crossings. 
These are intended to give more guidance to road users as to who has the right of way. 

The results are not all new; the simplest formula for the mean delay to pedestrians was 
quoted by Adams (1936). Garwood (1940) considered a problem connected with traffic 
lights which was mathematically almost the same as this, and later applied the results to 
pedestrian delays for one and two traffic streams. Since this paper was submitted for 
publication, Raff (1951) has derived the mean delay by a consideration of ‘blocks’ of 
vehicles, as in §2-3 of the present paper. He also notes equation (7) of §2-2-1. 
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2. THE DISTRIBUTION OF PEDESTRIAN DELAY (SIMPLEST CASE) 


Three derivations will now be given of the function P(7’), already defined. 


2-1. Garwood (1940) considered a very similar problem in connexion with the operation 
of vehicular-controlled traffic lights. In this, vehicles on the minor road arrive at random 
at a junction with a major road, and if the lights are against them, the lights change as soon 
as there has been a traffic-free gap J at the detector pads on the major road, or after a time 
M if this is sooner. Garwood gave the formula for the probability of a vehicle on the minor 
road being delayed for more than a time 7’. His derivation proceeded from an expression 
for the probability distribution of the longest interval in a line of unit length divided by 
n random points. It is easy to see that the probability P(7’) that a pedestrian is delayed for 
longer than 7 is the same as the probability that a vehicle on the minor road is delayed for 
more than 7' + J, for an infinite value of M, and substitution in his formula gives 





r+1/(_ )8 e—sNI N3(T' — oJ T)s r+1(_ )s e—sNI Ns-1(JT' — sJ J)s-1 
s= * s=1 7 





, (1) 
where r is the integral part of 7'/I. 


2-2. The next derivation of P(7’) is perhaps the most elegant, though it is possibly less 
useful than the third one. It is convenient to change the time-scale so that the unit of time 


is equal to the critical vehicle interval J. Let w and q refer to waiting time and flow on this 
scale, so that w=T)I, 


(2) 
and q= NI. (3) 
It is also convenient to define a= qe~4. (4) 


The symbols P and f will be retained, since this will not lead to any ambiguity. 


2-2-1. A pedestrian crosses immediately if no vehicle is due to arrive in the next unit time, 
and the probability of this is e~?. Hence 
P(0) = 1—e-*. (5) 
If 0<w<1, a pedestrian will start to cross in (w, w+dw) if and only if (i) a vehicle has just 
passed, in (w, w+dw); and (ii) no other vehicle will pass in (w, w+ 1). One expression for the 
probability of this event is —P’(w)dw. (It is easy to see that P(w) is differentiable for 
0<w<1.) It is also the product of the probabilities of (i) and (ii), which leads to 
P’(w) = —a. (6) 
If w> 1, one other condition must be added to the above, namely, (iii) the pedestrian was 
waiting at time w—1. This is independent of the other two events, and has probability 
P(w- 1). Thus we have P'(w) aa —aP(w- 1) (w> 1). (7) 
At w = 1, P’(w) is evidently not continuous, though P(w) will be so. It is clear that in the 
ranges (0,1), (1,2), (2,3),..., P(w) is successively a linear, quadratic, ... function of w. By 
successive integrations the function P(w) can be obtained as 
P(w) = 0(w) —e-76(w — 1), (8) 
a2 3 a’ 
where Ow) = 1—-aw+ = (w—1P— 3 (w—2)+...+(—P > (w-rtlyt..., (9) 


the series continuing so long as w—r+1>0. 


a 








— 
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Equation (8) can, with a change of units, be put into the form of equation (1), based on 
Garwood’s method. 
It can be shown by differentiating (6) and (7), that 


f'(w) =0 (0<w<l), . (10) 
and f'(w) =—af(w-1) (w>1), (11) 
and hence that f(w) = a0(w—1)—ae-*O(w—2) (w>90, +1). (12) 
In the ranges (0,1), (1,2), (2,3),..., f(w) is successively constant, a linear, quadratic, 


... function of w; it is discontinuous at w = 1. 
2-2-2. The moment-generating function (m.g.f.) M(t) of the distribution of the delay is 
given by the Stieltjes integral M,,(t) = expectation of e 
=— [“e aP(w). 
v0 
Using integration by parts and equations (5), (6) and (7), we obtain 
e**(q—-t) 
AD) se ee ae 13 
Melt) = ante (13) 
Expansion of this function leads to the following expressions for the mean and variance of 
the distribution : 











’ @—g—1 
p(w) = —— (14) 
2q_ 9g e9—1 
pao) =§ (15) 
In the original units, the mean delay will be 
NI_ NJ —1 
ay (T) = : waa (16) 


2-3. The third derivation of P(w) is in one sense more straightforward; the solution is built 
up one step at a time. 

2-3-1. Each vehicle passing X ‘blocks’ the road to pedestrians for a unit time before he 
reaches it; no pedestrian can start to cross in this interval. Let k(s)(s = 0,1, 2,...) be the 
probability that a random pedestrian has to wait for exactly s vehicles to pass before he can 
start to cross. This is the probability that the first s time-gaps between vehicles are all less 
than unity, but that the s + 1this greater than unity. This clearly has probability (1 — e~“)* e~*. 
Putting A= 1-e-4, (17) 
we obtain k(s) = A8—Ast+1 (8 = 0,1,...). (18) 
Now, given that the pedestrian has to wait for exactly s vehicles to pass, the distribution of 
his delay, w,, is the same as the distribution of the time before the sth vehicle arrives. This is 
the sum of s independent intervals, each of which has a distribution 

5 et dw 
between 0 and 1. The m.g.f. of this distribution is 


*q 
M -[. rie ete dw 





— 7 
ne waa 1). (19) 
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Hence the m.g.f. of the distribution of w, will be M*. The m.g.f. of the delay, averaged over 

all values of s, is therefore wo 

M,,(t) = 2% k(s) M* 
s=0 


_ ett (q—t) 


eens Seis, 20 
qe4—te*’ (20) 
as before, substituting from (17), (18) and (19). The series converges if ¢ is less than the root of 
te+=a, 


other than ¢ = q. 


It should be possible to obtain the frequency function from this expression by application 
of the Laplace transformation, but a satisfactory method of doing this has not so far been 
found. 


3. THE SIZE OF GROUPS OF PEDESTRIANS 


It is clear that pedestrians will cross the road in groups (actually in groups of arrival-groups, 
though at present we assume that pedestrians arrive singly), the groups tending to be larger 
when the vehicle flow or the pedestrian flow is larger. A proportion e~¢ of pedestrians are not 
delayed, will not cross immediately behind vehicles, and will all cross in groups of size unity. 

We define variates a, #, y, 6 and € as follows: 

a: the number of pedestrians waiting to cross when a random vehicle passes X ; Ag, Aj, Ag, ... 
are the probabilities that « = 0, 1, 2,...; 

#: the number of pedestrians crossing immediately behind a random vehicle (probabilities 
Bo, B,, ...); 

y: the size of a random group of pedestrians crossing (probabilities C,, C3, ...); 

6: the number of pedestrians waiting at a random time (probabilities Do, D,, ...); 

e: the size of group with which a random pedestrian crosses (probabilities Z,, Z,, ...). 

Also let p = nI be the average number of pedestrians arriving per unit time (on the changed 
time-scale). 

We derive below the factorial moment-generating functions (f.m.g.f.’s) of a, 2, y, 6 and e, 
and hence their mean values. The complete probability distributions have not been evaluated, 
but the probabilities of 0 and 1 are given for each distribution. 


3-1. Given that the road has been ‘blocked’ to pedestrians for a time w (at a general time 
this has the same distribution as the delay w), the probability that there will be r pedestrians 


waiting is e-P’ (pw) 


7 (ry = 0, 1, 2, ...). 
The m.g.f. of r will be ; 2 e-Pw (yw) .e* 
. My(t) = py iP 
r=0 r: 
= epwie-1), (21) 


Hence at a random time the m.g.f., M,(¢), of 6, the number of pedestrians waiting, will be 
Ss “ere dP). 
0 


This is clearly M,,(v), where v = p(e!— 1). We therefore have 
et (q—v) 
qe-*—ve’ 


M,{t) sa FM,(e— 1) es FM,(v/p), 


M,(t) = from (13). (22) 





ged over 


(20) 


e root of 


lication 


far been 


e larger 
are not 
> unity. 


pAy «.. 


bilities 


(21) 
ill be 


(22) 


(23) 


——— — hs 


: 
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where F'M,(t) is the f.m.g.f. of the distribution of x with argument t, we have 


_ et (q—pt) 
FEM) = are (24) 
3-2. To find the generating functions of the distribution of «, we note that when a vehicle 
is at X, the road has been ‘blocked’ for a time 1 + w, where w has m.g.f. given by (13); hence 
the f.m.g.f. of « is obtained by replacing w by 1+ w in the above argument. Thus 


M,{t) = My,0(); 
_ e*(g—pt) 
and so FM,(t) = qet—e™* (25) 


3-3. Next, to obtain the generating functions of the distribution of £, we note that r (> 0) 
pedestrians will cross behind a random vehicle if there are r waiting and if there is no vehicle 
in the following unit time. Hence 


B,=eA, (r=1,2,...), (26) 
and since > B= > A =1, 
r=0 r=0 
we must have By = e-* Ag+ 1—e-%. (27) 
Therefore M,{t) = e* M,(t) + 1-4, 
—d— e—pt 
and so FM,(t) = 1—pt “uae (28) 


3-4. The generating functions of the distribution of y may be obtained from those of 
B as follows: C, will be proportional to B, for r = 2, 3, ..., since all groups of size 2 or more 


must be behind a vehicle. Let, then, 
C, = KB, +4, 


C, = kB,, 
C, = «B,, and so on. 
We may find « and y as follows: We have 
1 = K(1—B) +4, 


and tet = ErB, | (ZB, +2), 
i 1 


the second equation expressing the fact that a proportion 1—e~¢ of pedestrians cross over 
immediately behind a vehicle. 


Now By = (l—e-%) +e Ay 
and y | e-Pil+) d P(w) 
ei. 
mas ? M,,( Pp) ~ ge4+ per’ 
e-i— eP 
Thus By = 1+ Pe ett pe 
Also = rB, = 44(8) = p(l—e-*)/q, 
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on expanding M,(t). The two equations for x and yw can now be solved, giving 
pea per+qe4% 
—_—K = Ss ,, 
q ESF (P+D 
M(t) = «(My(t) — By) + ne, 
> pi-—q _te"gqe4+peP 
FE = i+ eer em =6g+p 
3-5. To obtain the generating functions of the distribution of e, we note that 


We finally have, therefore, 


leading to 


E, = 16, Sic, @=1,2)...). 
1 





] f°) 
Hence M(t) = ——~ UG re 
« yobs 
_ 1 dM,(t) 
My(y) dt ’ 
which on evaluation leads to 
(1+t)e-¢-? 








iss dee (pte —qe-a? [p*t? e-P! + qe-4(— 2pt +g + pt? — pqt)]. 


Table 1. Mean values of «, 8, y, 8, €, and probabilities of 0 and 1 





















































} 
2 Mean, % Probability (x = 0) Probability (~ = 1) 
A p(e*—1) e~* (g+P) p e~* [—ge*+e(p? + pg+q)] 
q ge“*+pe? (qe*+pe?}?* 
B p(l—e~*) — er se-t p e~*@ [—q e* +e" p*+pg+q)] 
gq ge*+pe? [ge*+p er} 
y ge* + pe” 0 2, ge +p) 
e-*(q +P) pt+q qe“*+pe? 
3 P (et—q-1) e?-4(q-+p) pq e?-*[e*—(p+q+1) e~*] 
q ge*+pe? [ge*+p e?}? 
2p e?~*[p* e? +g e-*(2p +g +p*+pq)] 
€ 1+— (e¢—q-1 0 a Lt 
q = (qe*+p e?}? 








(29) 


(30) 


(31) 


(32) 


3-6. We can write down the probability-generating functions (p.g.f.’s) of «, 8, y, d and € 
quite simply in terms of the f.m.g.f.’s. For any distribution in which the variable takes 


only positive integral values we have the identity 


p.gf. of z = P(t) = po+tp, +t p,+... 


where p, = probability that x = r. 


= FM,(t-1), 


Te 2 








9) 


0) 


2) 


f= 
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The first few terms of the moment- and probability-generating functions have been 
expanded and Table 1 gives the expressions obtained. Figs. 1(a)-(d) show how the means 
vary with p and q for the ranges of pedestrian and traffic flow 0<p<2,0<q<z2. 
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(ii) €, the size of group in which a random 
pedestrian crosses. 


3-7. The distributions just obtained for «, £, y, d and € can be extended quite simply to the 
case when pedestrians arrive in independent groups, provided the distribution of the size 
of these is known. 

Let g be the number of pedestrians in a random arrival-group, and let M(t) and K,(t) be 
the m.g.f. and cumulant-generating function (c.g.f.) of its distribution. 

Consider, for example, the distribution of «. Suppose that a = r. Then the m.g.f. of the 
number of individuals in this group will be [,(t)]’, or e"*#, If we now average this m.g.f. 
over the distribution of «, we obtain the m.g.f. of the number of individuals waiting when 
a random vehicle passes as 

Aye + A, eXd" + A, cKO +... = M,(K,(t)), 
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i.e. to obtain the m.g.f. of the number of individuals, we replace t by K,(t) in the m.g.f. of the 
number of arrival groups. 


The same rule can easily be seen to apply with £, y, 6 or € in place of a, and the m.g.f. may 
be replaced by the c.g.f. or the f.m.g.f. 


4, EXTENSION OF DISTRIBUTION OF DELAY TO CROSSING TWO STREAMS OF TRAFFIC 


We next consider what may happen when a pedestrian wishes to cross a road in which there 
are two opposing streams of traffic. There are various possibilities as to the gaps in the traffic 
which a pedestrian may wait for; the following are three of the simpler methods he may use: 

(1) He may wait till there is a gap of at least J, in the combined traffic stream, and then 
cross straight over. 

(2) He may wait till there is a gap of at least J, in the near stream, followed immediately 
after by a gap of at least J} in the far stream, and then cross straight over. 

(3) He may wait till there is a gap of at least J, in the near stream, cross over to the centre 
and then wait for a gap of at least J; in the far stream. 

The third method will normally be used when there is a refuge in the centre of the road. 
When there is not, it is not obvious whether the first or second method is a better repre- 
sentation of what occurs in practice. 

We assume that the traffic flows on the two halves of the road are equal, both N per unit 


time, and that Tes li, = I, say. 


These assumptions will usually be nearly true, though others can be dealt with quite easily. 
The second assumption would be true, for example, if a pedestrian waited for a gap in a traffic 
stream equal to, or a fixed multiple of, the time he takes to cross the stream completely. 
The mean delays 4;(7), (7), “;(7) caused by the three methods can be simply obtained 
as follows: 
The first method involves waiting for a gap 2/ in a stream 2, so by equation (16) 


aNI_4NI—1 
pT, = ——S—). (33) 





The second method involves waiting for a gap 2/ in a stream N, so 


e2NI_2NT—1 
/y(T2) = NV ‘ (34) 


The third method involves waiting twice for gaps J in streams N, so 
wim) = Ae), (35) 
The corresponding m.g.f.’s may be written down if required. 

Fig. 2 shows the probability of being delayed for longer than 7’ using each of the three 
methods, for various traffic flows and J = 5sec. Fig. 3 shows the mean delays caused by each 
of the three methods, for various traffic flows and I = 3 and 5 sec. These curves show clearly 
that installation of refuges decreases the delay (since method (3) can be used), and also that 
the mean delay increases much more than proportionately with vehicle flow. However, 
when the flow is very heavy, the assumption of randomness in the arrival of vehicles breaks 
down, probably to the advantage of the pedestrian. 





_— 





the 


nay 


(34) 


—_e 
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The sizes of groups of pedestrians, corresponding to a, /, y, d and ¢ already considered, 
have not been investigated in the case of two streams of traffic, though some of the results 
for a single stream apply directly. 
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than a given time 7’, for the three methods of crossing two streams of traffic. 


crossing two streams of traffic (I = 5sec.). 


5. THE EFFECT OF A VARIABLE CRITICAL GAP I 
It is, of course, not true to suppose that all pedestrians have the same critical gap I; some can 
walk faster than others, and, in addition, there is evidence that an average pedestrian will 
accept a slightly shorter time gap if the next vehicle is fast-moving, though the distance of 
the vehicle will be rather more. 
Assume for convenience that J varies according to Pearson’s Type III distribution, with 
mean J, and coefficient of variation »v, i.e. that the proportion of pedestrians with a critical 


gap J between J and J +dI is 
1 


(J,v®)¥* P(1/v*) 
The mean delay to pedestrians crossing one stream of traffic will now be 
rd oe 1 
0 N heperap 
This may easily be evaluated as 
ae — NI,v*)-¥"—- NI,-1 
V , 


As v0, this expression tends to the value given by (16). 
As an example, if we take N = 360 vehicles per hour, J, = 5sec., v = 0-0, 0-2 and 0-4, the 
mean delays are v= 0:0: 7 = 1-49sec. 
0-2: 7 = 1-57sec. 
0-4: T = 1-84sec. 


PUP-1 e-Ie* dI  (O< I< ). 


[ie?-1 e—liInv? qT, 








(36) 
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6. COMPARISON OF THEORY WITH SOME OBSERVATIONS 


Some measurements of pedestrian delays have been made at a pedestrian crossing without 
a refuge. The traffic remained fairly constant during the observations at an average level of 
1090 vehicles per hour, both directions combined. 

The mean delay to 325 pedestrians was 9-2sec. Using this and the vehicle flow above, we 
can calculate the critical gap J from the formulae for the mean delay, using either the first 
or the second assumption about the method used to cross. The first assumption, that 
pedestrians wait for a gap J right across the road, leads to J = 5-6 sec., which agrees well with 
the average observed time taken to cross of 7sec. On the second assumption, however, that 
pedestrians wait for a gap J in the near stream, followed immediately by a gap J in the far 
stream, we obtain J = 8-7sec., which is unreasonably large for crossing one stream of traffic 
only. Thus the first method of crossing seems to be in more general use in this case. The 
theoretical distribution of delays for the first method of crossing, with N = 1090 vehicles 
per hour and I = 5-6sec., has been calculated and is shown graphically in Fig. 4, together 
with the observed distribution. The fit for delays below about 2sec. is not good, but the 
reason for this may be that some people will delay for a second or so although they could 
quite safely cross without any delay, and according to our theory, would do so. A x? goodness- 
of-fit test on the data, with 6 degrees of freedom (corresponding to eight frequency groups), 
gives x? = 6-2, indicating that the fit is satisfactory for the subdivisions shown in the figure. 

50 





3 








—— Observed frequencies 
| ~~ -— Expected frequencies 


w 
o 














id 
So 





Frequency per 1 sec. interval 


= 
oO 








Greater than 
































ouaead 0 sec. 
1 a —_ 12 
a. crea oe ee 


Delay (sec.) 
Fig. 4. Comparison of observed and theoretical distributions of delay. 
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FURTHER APPLICATIONS OF RANGE TO THE ANALYSIS 
OF VARIANCE 


By H. A. DAVID 
University College, London 


1. INTRODUCTION 


The use of ‘range’ as a short-cut substitute for variance has recently been considered in the 
more elementary problems of analysis of variance. Patnaik (1950) dealt with a simple 
classification with equal cell frequencies and Hartley (1950) with a double classification. In 
this paper these methods are extended to cover a number of further orthogonal designs 
including: (a) a double classification with cell replication; (b) a double classification with 
two-factor ‘treatment combinations’; (c) the split-plot experiment. A procedure for the 
general three-way cross-classification is indicated. Approximate methods to deal with 
a simple classification with unequal cell frequencies are also considered. 

In each of the main cases an example is given as well as a summary of procedure, which it 
is hoped will be adequate for the reader interested only in the practical applications of the 
methods developed. 


2. DOUBLE CLASSIFICATION WITH CELL REPLICATION 


The essential difference between range methods and the orthodox analysis of variance 
consists, it will be recalled, in the replacement of mean-square estimators of variance by 
statistics involving ranges of observations in subgroups or of group means. We first describe 
the procedure in terms of an example. 


(2-1). Example 1 
Tables 1a and 16 form a convenient summary of an experiment on Portland cement 
(Davies, 1947, p. 91). Three ‘gaugers’ prepared twelve samples of Portland cement each, 
four of which were tested for compressive strength by each of three ‘breakers’. Table la 


gives the totals, and Table 16 the ranges in lb./sq.in. of four test results for the nine 
breaker x gauger combinations. 





























Table 1a. Totals of four replicates Table 16. Ranges of four replicates 
| \Besnleee Breaker 
1 2 3 Total 1 2 3 
Gauger Gauger 
1 136 - 4 — 74 58 1 104 186 116 
2 18 138 — 194 — 38 2 116 132 62 
3 270 166 — 33 403 3 80 96 114 
Total 424 300 —301 423 























Origin at 5000 Ib./sq.in. 
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Estimation of residual error. From Table 16 the mean range is given by 
® = (104+...+114)/9 = 111-8, 


and the estimate s,, of the error standard deviation is at once obtained as 111-8/2-08 = 53-8, 
the divisor, c = 2-08, as well as the equivalent degrees of freedom, v = 24-9, of the estimate 
being obtained from Table I as described in the Appendix.* (From the within-groups sum 
of squares Davies finds s = 51-4 with 27 degrees of freedom.) 

Estimation of interaction. Table 1a is clearly an ordinary double classification of the group 
totals. Following Hartley (1950), we set out in Table 1c the ‘breaker residuals’ on which 
the estimate of the interaction term} of such a design is based (for example, the first entry, 
— 5, is given to the nearest integer by 136 — 424/3). The figures in each column sum to zero 
except for rounding-off errors, which provides a useful check. If there is no real interaction 
we can, as usual, obtain an estimate s/, of the error s.p. from the interaction term. 
Remembering that Table 1c gives totals, this will be sj, = 354 x ,/4/(12 x 1-48) = 39-9; the 
corrective factor c’ = 1-48 and the equivalent degrees of freedom, v’ = 3-7, are read off this 
time from Table II. 


Table lc. Breaker residuals (totals) 











ey 
\ 1 2 : 3 Range 
Gauger $5 
1 —- 5 — 104 26 130 
2 — 123 38 —94 161 
3 129 66 67 63 
Total 354 


























Test for interaction and main factors. s/2/s?, with s,, = 53-8 and s/, = 39-9 may now be 
referred to tables of the F-ratio with degrees of freedom v’ = 3-7, v = 24-9. The interaction 
is obviously not significant. 

To test for the main gauger and breaker factors we form ‘Studentized’ ranges q,, 7p. 
Thus for gaugers, qa = {403 —(—38)}/(./(12) x 53-8) = 237, 

Ip = {424—(—301)}/(,/(12) x 53-8) = 3-89. 

On consulting tables of percentage points of q (Pearson & Hartley, 1943; Hartley, 1950) 
with v = 24-9 and sample size 3, only gp will be found significant (5 % point, 3-53, 1 % point, 
4-54). 

These three tests correspond to those made by Davies and give in each case the same result. 
It may be noted that the labour involved in using the range method is, in contrast to the 
standard procedure, little affected by an increase in the number of replications. 


and for breakers 


* It will be remembered that the corrective factor c is used to give an estimate, the expected value 
of whose square is o*. The commonly used corrective factor d, leads to an unbiased estimate of o. As 
the number of observations n increases, d,, > c. 

t In the ordinary double classification this term provides the ‘error’. 
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(2-2). Theoretical basis 
Just as in the orthodox analysis of a double classification with cell replication, it is necessary 
to distinguish between the ‘systematic’ and ‘random’ probability models. We discuss first 


the systematic case, the analysis of which has been illustrated by the preceding = 
Systematic model. This set-up may be written as 


Xin = K+ A+B +]; +2% (§=1...0;f =1...m; t= 1...2), (1) 


where capital letters denote systematic effects which are subject to the constraints 


l m l m 
i=1 j=1 i=1 j=1 

The z;, are mutually independent normal variates with zero means and variance o?. We 
then have the fundamental equation 


w, (i,j) =range (%;_) = range (z,,), for fixed i,j andt=1...n, (2) 


so that the statistic w,,(t,j) can be computed as the range of observations x, and its dis- 
tribution regarded as that of the range of the random variables z;,. Patnaik (1950) has 
approximated to the distribution of the mean w of a set of independent ranges by equating 
the expectation M and variance V of w/c to those of cy/,/v, where x has v degrees of freedom. 
Table I, giving the constants c, v, is an extension of Patnaik’s Table 1, and is to be entered here 
with sample size n, number of samples k = lm to give s,, = W/c, the estimator of o. 

From (1), %,;,= K+A,;+B;+J,;+2,;. Corresponding to (2), we reach the equation 


w;(j) =range (%;; —%;_) = range (I;;+2,; —2Z,_.), for fixed j andi = 1...1. (3) 
The m ranges w; are not independent. However, for the null-hypothesis 
I,;=0 (¢=1...0; 9 = 1...m) 


Hartley (1950) obtained the mean and variance of w’ and used these to fit a similar y-approxi- 
mation to the mean of the correlated ranges. His table of the resulting v’ and c’ is extended 
below in Table II. It is clear that on the null-hypothesis s/, = w’,/(n)/c’ is an estimator of 
o irrespective of whether A; and/or B; are present. On the alternative hypothesis (J;; not all 
zero) 8, will be seen from (3) to depend on the interaction terms J;; and the z;, only. It may 
be shown that s,, and s/, are independent (see §4), so that on the null-hypothesis s/2/s?, is 
distributed approximately as F with degrees of freedom v’, v. Finally, sincez, = K+A;+2;,_, 
the main A effects may be tested by the usual ‘Studentized’ range criterion; thus 


qa = \(mn) x range (Z;,.)/8, Ie = \(ln) x range (2, ; )/8,p. 
Random model.* For this case we may write 
Lig = K+ a; +b; + Uys + Zin (t= 1...1; 37 =1...m;t=1...n), (4) 


where the a,,6,, u;;,2;, are all mutually independent normal variates with zero means and 
respective variances o2, o?,0’%, 0%. From a slight modification of the above arguments it 
follows that, as in the ordinary analysis of variance, we proceed in the same way as for the 


systematic case, except that main factors must now be tested against s, so that 


da = (mn) range (Z;__)/8ip- 


* For the null-tests, the essential feature of the random model is the random interaction term u,;. 
The main effects may be systematic, or random as in (4). 
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(2-3). Summary of procedure 
Results may conveniently be set out in Table 2. 


Table 2. Summary table for double classification with cell replication 





Results for Example 1 





s Function Sample | Degrees of 
oe, calculated size freedom Numerical S, ; Degrees 
value of Er of 
function size | freedom 
































(i) Treatment A | ./(mn) range (%,_) 1 — 4/(12) x 441/12 3 — 

(ii) Treatment B | (In) range (z,; ) m — (12) x 725/12 3 — 

(iii) Interaction 81, = w’./(n)/c’ —— v’ (Table II) | (354/12) x ./4/1-48 ao 3-7 

(iv) Residual 8y = W/c — v (Table I) 111-8/2-08 — 24-9 
Tests 


For treatment A: 
Random model: refer (i)/(iii) to tables of g with sample size / and p.F = v’, 
Systematic model: refer (i)/(iv) to tables of g with sample size / and p.F. = v. 


For interaction refer, on either model, (iii)*/(iv)? to tables of F with degrees of freedom y, v’. 


(2-4). Simple hierarchical classification* 
Another set-up which may readily be handled by the methods so far developed is the 
simple hierarchical classification represented by the model 


Xin = K+ A; + Bi + 2,5 (§Q=1...,f=m1...m;t=1...2). (5) 


3. DOUBLE CLASSIFICATION, ‘TREATMENTS’ x ‘BLOCKS’, WITH TWO-FACTOR 
‘TREATMENT COMBINATIONS’ 


For definiteness we consider the theoretical model in the systematic form 
Xin = K+ A+ B+ 15 +C,+2 4, (6) 


where the suffices (7,7) denote the treatment combinations which may be broken up into two 
treatments and their interaction and t is the block index. 


Since from (6) Ey = K+ A,+B,+1;+2,;, (7) 


it follows that the residual error may be estimated in the same way as the interaction 


(blocks x treatment combinations) in an /m x n ordinary double classification. The J,; inter- 
action term is estimated as in §2. 


(3-1). Example 2 
The full procedure is set out in Table 3a (Paterson, 1939, p. 206). 
The block means are: Z_, = 217, %_, = 200,%_, = 201, , = 215. 
We now form the block residuals 2,;,—%_,(i = 1...3; 7 = 1... 4) as in Table 3b. Hence 
@ = 735/12 = 61-25. From Table II, c = 1-98, v = 24-4+5-4 = 29-8, so that s,, = 30-9. 
(Paterson obtains s = 32-2 with 33 p.¥.) The interaction may be computed most conveniently 


* For an example see Davies (1947, p. 81). 
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(asin § 2) from the totals 4(z,, —Z_; ), resulting in the ratio 1229 x ,/4/(30-9 x 12 x 1-76) = 3-77, 
whose square when referred to the F-tables with degrees of freedom 5-6, 29-8 is highly 
significant. We also find 


— | 4 = 2260/./6 x 30-9 = 18-3, gg=21-1 and gg = 1:85, 






































































































































} 
so that all factors except blocks are highly significant, a result identical with that reached 
by Paterson. 
rees 
f Table 3a. Yields (in lb. of dry matter per 1/57 acre) of three varieties of grass 
_ with four cutting rotations, repeated in four blocks 
a Grass ... 1 | 2 3 
- , Block | Rotation 
‘7 total total 
- Rotation... 1 | 2] 3]/4/]1/] 2/3 | 4/1] 2] 3] 4 
Block 1 96 | 187 | 222 | 109 | 146 | 252 | 246; 277/115; 298) 220] 430) 2598 | A= 1397 
2 70 | 163 | 125| 97 | 133 | 181 | 263 | 293] 143) 220; 341] 371) 2400 | B= 2582 
, 3 77 | 143 | 134 | 133 | 154 | 224| 194] 260] 117] .234| 258] 484] 2412 | C= 2663 
f 4 80 | 179 | 173 | 113 | 146 | 248 | 190; 325) 120) 253) 297) 460) 2584 | D= 3352 
" | Totals 323 | 672 | 654 | 452 | 579 | 905 | 893 | 1155 | 495 | 1005 | 1116 | 1745 | 9994 
> Ad | » 
2101 3532 4361 
the | 
j . 
Table 3b. Block residuals and their treatment ranges 
(5) 
Grass... 1 2 3 
Rotation ... 1 2 3 + 1 2 3 4 1 2 3 4 
(6) Block 1 —121 ) —30 5 | —108 | —71 35 29 60 | —102 | 81 3 | 213 
2 —130 | —37 | —75 | —103 | —67 | —19 63 93 | — 57 | 20| 141 | 171 
wo 3 —124 |} —58 | —67 | — 68 | —47 23 | -— 7 59 | — 84) 33 57 | 283 
4 —135 | —36 | —42 | —102 | —69 33 | —25 | 110} — 95 38 82 | 245 
(7) 
- Range 14 28 80 40 24 54 88 51 45 | 61 | 138 | 112 
r- 
(3-2). Summary of procedure 
Results may be tabulated as in the preceding paragraph (see Table 4). 
' 
Tests 
ce For treatment A: 
9. Interaction random: refer (i)/(iv) to tables of g with sample size | and D.F. = 3. 
ly : Interaction systematic: refer (i)/(v) to tables of g with sample size / and D.F. = 1}. 


For blocks C refer (iii)/(v) to tables of g with sample size n and D.F. = v; in either case. 
For interaction refer (iv)?/(v)* to tables of F with degrees of freedom 14, v}. 


_— 


26-2 
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Table 4. Summary table for double classifications with two-factor treatment combinations 














Results for Example 1 
Function Sample | Degrees of 
Source calculated size freedom Numerical Degrees 
value of Sample of 
function 81Z€ | freedom 
(i) Treatment A (mn) range (2,_.) l — 565 3 —_ 
(ii) Treatment B (In) range (Z ;.) m J 652 4 —_ 
(iii) Blocks C (lm) range (7. 4) n — 57-2 4 — 
(iv) Interaction A x B St, = Wa./n/cg — vg (Table IT) 116 — 5-6 
(v) Residual Sv, = W,/c; _— v; (Table IT) 30-9 _ 29-8 


























4. THE SPLIT-PLOT EXPERIMENT* 


The theoretical model appropriate to this design contains at least two random terms and may 
be written as 

tiy = K+ A+ B+ uj t+C.+ yt Zip, (8) 
where the symbols have their previous meaning so that ,;, 2; give rise to two independent 
error terms, here usually called error (1) and error (2), respectively. The only new feature to 


which this set-up gives rise is a slight modification of the way in which the residual (error (2)) 
is estimated. 


Estimation of 7. From (8) we have 
Hig — Zig, = Cyt Ty t Zip — 245. 
and consequently 

W,, (7, t) =range (x,4—%;;) = range (2; —2,;.), for fixed i,t and j = 1...m. 


Thus we obtain /n correlated ranges w,,(i, t). The estimate of o will again be #/c with v degrees 


of freedom, v and ¢ being in this case obtainable from Table III (see Example 3) whose 
derivation is explained in the Appendix. 


(4-1). Theoretical basis 


-In §2 we mentioned Patnaik’s (1950) approximation to the distribution of the mean of 
independent ranges by fitting a y-distribution adjusted to have the correct mean M and 
variance V. If the same approximation is made for the mean of correlated ranges the 
resulting v and c may be determined quite generally from his equations 


V 1 1 1 


= >* 32 Test” (9) 

oi 5 

and en BaP aly OB 
n ¢ M(1+Z+355 aa): (10) 


provided that for any given design we find the appropriate expressions for M and V. 
For the split-plot design the correlation between 2,4 —2,;, and zy —Z,;, is zero if i +7’ and 
is —1/(n—1) if i = 7’, so that the In ranges fall into / independent groups of n. Within each 


* For a description and application of this design see, for example, Snedecor (1946, p. 309). 
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group any two ranges are correlated with correlation coefficient p,, tabled by Hartley (1950, 


Table 4). Here p,, is a function of p = —1/(n—1) and the sample size m. Therefore 
M=6(w) = d,,o(1—n7"), (11) 
V =var (&) = var (> > w,,(#, t)/In) 
it 
= (l+n—1p,) Vn o*(1—n-")/(In), (12) 
eS 
and ym In d?, > (1 + (n— 1)p,,(n, m)\, (13) 


where d,, is the expectation and V,, the variance of the range of m independent unit normal 
variates. From (9), (10), (11) and (13) Table IIT can now be constructed. 

Before giving an example we shall prove that each of the test criteria employed has 
independent numerator and denominator, even when the null-hypothesis does not hold 
(i.e. some or all of the effects are real). The method of proof is quite general (compare Hartley, 
1950). From the ordinary analysis of variance, it follows that for the present set-up 


Latin = Leg — 2; 4-25, +2)? +B Ty +21-%,,-Z +2” 
+ 2U(Cp+2, 4-2)? +2 (YiG-Hi..-Y. 5+ HP 
+2(Bi+ 95-9? +2U(A;+H.-Y)? + NZ, 
where = denotes summation over all observations and y;; = u;;+%,;. Reducing the number 
of variates in each X-term to its degrees of freedom and introducing 


Vit = ijt — 243, at %.. (t => ) ete = 2...™M; t = 2... %), 
Zit => Tg 24-2... —2..4 +2 (a = 258: t = 2...2) etc., 


the first two terms become 
l l 
>> {Od vF,+ x (X Mp)? +X (DX M4)? + (ZX MH) 4 DD 2+ X(T Zu)? +d (x Zi)? + (x Zi)*, 
i=l] j ¢ a , 2 t,j i=2 t wv’ e-< i,t 


and it is clear that the joint distribution of the v,,, is independent of that of the Z,,, etc. But 
the ranges of z,;,—2Z,;, may be defined in terms of the r new variables v;,, so that they are 
distributed independently of the ranges of the J;,,+2; ,—2;,,, which can be defined in terms of 
the Z,,, etc. The general result follows. 


(4-2). Example 3 

Table 5 a is a section of Snedecor’s table (1946, p. 310) of the yields in tons/acre of three 
varieties, planted in six blocks and subject to four subtreatments. This table, giving the 
yields of one of the varieties is simply a 4x6 double classification. Subtracting the sub- 
treatment mean from each entry we obtain Table 5c, in which the ranges of the residuals in 
each block have been formed. In a similar way Table 5d is derived from the table of means 
(5 6) and enables us to estimate the variety-treatment interaction. 

Estimation of residual error. We have 1 = 3, m = 6, n = 4. Wis the mean of eighteen ranges 
of which the first six are listed in Table 5c. Using all varieties we find # = 0-310 and from 
Table III that c = 1-90, giving s,, = @/c = 0-163. v is obtained by multiplying v’ = 13-4 by 
three,* the number of varieties, so that v = 40-2..(Snedecor finds s = 0-161 with 45 degrees 
of freedom.) 


* For a justification of this approximate procedure, see the Appendix (Table ITT). 
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Table 5a. Yields of variety Table 5b. Variety-subtreatment means 
Variety 
Sub- Block Sub- (main treatment) 
treat- Mean treat- Mean 
ment ment 
1 2 3 a 5 6 1 2 3 
1 2-17 | 1-88 | 1-62 | 2-34 | 1-58 | 1-66 | 1-88 1 1-88 | 1-76 | 1-70 | 1-78 
2 1-58 | 1-26 | 1-22 | 1-59 | 1-25 | 0-94 | 1-31 2 1-31 | 1-30 | 1-41 | 1-34 
3 2-29 | 1-60 | 1-67 | 1-91 | 1-39 | 1-12 | 1-66 3 1-66 | 1-58 | 1-48 | 1-57 
4 2-23 | 2-01 | 1-82 | 2-10 | 1-66 | 1-10 | 1-82 4 1-82 | 1-64 | 1-61 | 1-69 
Mean 1-67 | 1-57 | 1-55 
Table 5c. Subtreatment residuals of Table 5a Table 5d. Table for the estimation of 
and their block ranges the variety-treatment interaction 
Block Variety 
Sub- Sub- 
treat- treat- Range 
ment | | 2 3 4 5 6 .| Ment 1 2 3 
1 0-29 0-00 | —9-26 | 0-46 |} —0-30 | —0-22 1 0-21 0-19 0-15 | 0-06 
2 0-27 | —0-05 | —0-09 | 0-28 | —0-06 | —0-37 2 — 0-36 | —0-27 | —0-14| 0-22 
3 0-63 | —0-06 0-01 | 0-25 | —0-27 | —0-54 3 —0-01 0-01 | —0-07 | 0-08 
4 0-41 0-19 0-00 | 0-28 | —0-16 | —0-72 + 0-15 0-07 0-06 | 0-09 
Range | 0:36 0-25 0-27 | 0-21 0-24 0-50 
































Estimation of variety-subtreatment interaction. From Table 5d we have 4%; = 0-45. Table IT 
gives cy = 1-54, vy = 5-4. Hence sj = Wy ,/(6)/cy = 0-179, so that Fg 49.9 = 1-24 (corresponding 
to Snedecor’s F, 4, = 1-25). The remaining procedure will be clear from the summary table 








(Table 6). 
(4:3). Summary of procedure 
Table 6. Summary table for the split-plot design 
s Function Sample Degrees of 
ries calculated size freedom 

(i) Main treatment A (mn) range (%;_) l — 
(ii) Blocks B (In) range (z,;.) m — 
(iii) Error (1) 8 = W’A/(n)/c’ — v’ (Table IT) 
(iv) Subtreatment C a/(lm) range (Z. .) n — 
(v) Interaction A x C 8} = Wya/(m)/cy — vy (Table II) 
(vi) Error (2) 8y = W/c — v (Table ITT) 
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Tests 


For treatment A refer (i)/(iii) to tables of g with sample size / and p.F. = v’. 

For treatment C refer (iv)/(vi) to tables of g with sample size n and D.F. = v. 

For error (1) refer (iii)?/(vi)? to tables of F with degrees of freedom v’, v. 

It will be noted that the same analysis is applicable to any three-way cross-classification 
with one of the three first-order interactions missing in the model, although there may be 
certain obvious modifications in the tests to be made. Often a priori evidence is available 


allowing us to assume that two of the factors do not interact so that the present model 
covers an important class of cases. 


5. THE THREE-WAY CROSS-CLASSIFICATION 
Here the systematic model is given by 


Estimation of «. From (14) 
Yin = Lin —% gg — Big +E 5 = Dy t 2-2-2547. 5.. (15) 
Again we form ranges for fixed i and ¢ giving In ranges w,,,(i, t). 
Now var (254 —2 4 — 243, +2,;.) = o(1—1) (n—1)/(In). 
Therefore M=64(wv) = J (ae dno, 


where d,, has been defined on p. 399 above. Arranging the ranges as in Table 7 (wy, = w,,(i, t)), 
we may distinguish three correlation coefficients: 

(i) Py, between ranges in the same row, 

(ii) ~,, between ranges in the same column, 

(iii) p,,, between ranges neither in the same row or column. 


Table 7. Pattern of ranges of residuals in a three-way cross-classification 





Wy Wye Win 








Wy Win 























It is easily shown that these are functions respectively of 
Pp, = —1f(n—-1),m;. py = —1/(l—1),m; ps = 1/{(n—1) (I—1)}, m.* (16) 


Thus V var (@) = 1 (21) Pan + = Nyt 1) C= 1) Pre) on (17) 


where V,, has been defined on p. 399 so that V may be obtained by entering Hartley’s (1950) 
table of p,, with the three pairs of values of (16).t Hence v and c can be determined. 

* p, is the correlation between the two ‘residuals’ y;;;, and Yist, Which are in the same (ith) row and 
correspond to the same j; similarly for p, and ps3. 


+ This, however, necessitates an extension of the existing table to smal! positive values of p, in view 
of ps. 
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It will be realized that the necessity of forming the ‘double residuals’ y,,, of (15) deprives 
the range method of much of its speed in the present case. At this stage it may therefore be 
useful to survey the applicability of this method to the commoner experimental designs. 
We have seen that a quick procedure is available in all cases when one or more of the inter- 
action parameters of (14) are absent. Moreover, if in an ”-way classification (n > 3) certain 
of the higher interactions can be neglected a priori, such a simple method will often still 
suffice. In a replicated 2” factorial experiment the residual can always be estimated, being 
the block x treatment interaction, the treatment terms being computed in the usual way so 
that all squaring of individual observations is still avoided. A similar method can be applied 
to certain cases of confounded factorial designs. The analysis of a Latin square set-up is, 
however, not possible without the formation of ‘double residuals’, and these, in addition, 


have a more complex pattern of correlations than the essentially ‘pairwise’ correlations 
which have so far arisen. 


6. SIMPLE CLASSIFICATION WITH UNEQUAL CELL FREQUENCIES 


We now consider the application of range methods to the analysis of variance of data 
classified in unequal groups. Both the accuracy of the approximations used and the labour 
saved (compared with the customary mean square procedure) will in general increase 
markedly with the total number of observations, which should exceed, say, twenty. In the 
following example (Kendall, 1946, p. 179) three ways of dealing with the present case are 
compared: (a) the standard technique based on the ratio of two sums of squares; (b) a method 
in which the between-group estimate of variance of (a) is divided by a new within-group 
estimate based on a ‘weighted mean range’; and (c) the ‘unweighted mean range’ method 
in which both the between- and within-group estimates are based on range. The first two are 
general methods both based on the F’-ratio test; the last is valid only in the case of nearly 
equal cell frequencies, but leads to a particularly short procedure which involves the q-ratio 
employed by Patnaik for the case of equal cell frequencies. 


Table 8. Lives in hours of four batches of electric lamps 














Batch Observations — | Mean Range 
1 1600, 1610, 1650, 1680, 1700, 1720, 1800 7 1680 200 
2 1580, 1640, 1640, 1700, 1750 5 1662 170 
3 1460, 1550, 1600, 1620, 1640, 1660, 1740, 1820 8 1636 360 
+ 1510, 1520, 1530, 1570, 1600, 1680 6 1568 170 
Total 26 900 
| 




















Method (a). Kendall obtains: between batches mean square = 14,787, residual mean 
square, s* = 6,880. Hence s = 82-9 with 22 degrees of freedom. 
Method (b). The within-batch estimate of error standard deviation is given by the expression 
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/V,,;). The quantities d,, /V,,, and d?,,/V, , are listed in Table IV for n; = 2—20.* 


1 


with v = $2(d2 
Thus 8,, = (200 x 3-904... +170 x 3-12)/(10-54 +... + 8-93 +0-5), 


yielding s,, = 87-4 with 19-4 degrees of freedom. The numerator of the F-ratio is calculated 
as in (a). ; 

Method (c). Here the estimate of residual error is simply @/c = 900/(4 x 2-66) = 84-6 
with v = 19-7, c and v being found by interpolation (at sight) in Table I, entered with 
n = Xn,/4 = 6-5 and k = 4. The three analyses may be summarized as shown in Table 9. 


Table 9. Comparison of three methods of analysing a simple classification with 
unequal cell frequencies 











Upper 5% 
Method Value of test ratio point 
(a) Standard F329 = 14,787/6,880 = 2-15 3-05 
(6) Weighted mean range F’3,19-4 = 14,787/(87-4)? = 1-94 3-12 
(c) Unweighted mean range Y4,19-7 = 112 x ./(6-5)/84-6 = 3-38 4-0 

















Thus all three methods show that the differences are insignificant at the 5 % level. More 
precisely, we find that the three test ratios fall respectively, at the upper 12, 16 and 11% 
points. The theoretical bases of methods (6) and (c) are given in the next two subsections. 


(6-1). The weighted mean-range method 
For the set-up under consideration, namely, 


yy = K+ A,+2; or y= K+uj,+%; (§=1...1; 9 = 1... ,), 


) 1 
a weighted mean of ranges ¥ «;w,,, («; constant) is evidently a suitable and computationally 
i=1 


convenient generalization of the mean range W so far employed as a measure of variability. 
To obtain the ‘best unbiased estimator’ s,, of this form, we solve for the «; by minimizing 
xa?V,, with respect to the «,, subject to the condition & (Za; w,,,)* = o?. The usual “undeter- 
mined multiplier’ technique leads to 
Ans) [op Be, (op ®s\*\* 
8, = La; Wp; = (27) [| yt (=>) { 


ni ni 


, Ong | (sng 1 
as Xd? /V,,> 1. Also the x-approximation gives to sufficient accuracy the degrees of freedom 
of s,, as v = 42d? /V,,.t 
This solves the problem of estimating o; if / = 2 it therefore enables us to make a t-test 
for the significance of the difference of two randomly chosen means of, say, m, and 7m, 


* As before d,, and V,,; are the expectation and variance of the range in samples of n,; independent 
unit normal variates. 

t+ Large cells (n,>12) may for greater efficiency be randomly subdivided into groups of approxi- 
mately eight (compare Grubbs & Weaver 1947). 
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observations, since (%,—%,)/{s,,(1/n,+1/n,)*} follows approximately a ¢-distribution with 
v degrees of freedom. For />2 no simple range statistic seems, in general, available for the 
estimation of the ‘between-groups’ term, which must be computed as a weighted sum of 
squares in the standard way. However, for nearly equal cell frequencies, a special procedure 
is possible and is developed in the following section. 


(6-2). Range test for a simple classification with approximately equal cell frequencies 


In the case of equal cell frequencies (n; = n; i = 1...1) Patnaik (1950) has shown that 
a test for between-group variability is obtained by gauging g = ,/n range (%; )/(w/c) against 
the 100x % point of the ‘Studentized’ range for ] and v, where both ¢ and v are given by 
Table I. It is therefore natural to consider the use of g when the n; are approximately equal, 
with 7, the mean cell frequency, replacing n. Some numerical investigation has indicated, 
as would be expected, that the inaccuracy introduced by this approximation is slight for 
the denominator of g. We now consider how the numerator is affected and what range of 
variation of the n; is permissible for a sufficiently accurate use of this test. 

On the null hypothesis (A; = 0 or u; = 0; 1 = 1...1) we have 


w,=range (%;) = range(z;) (¢=1...l). 
To simplify the notation, write Z; as z;. Then the joint probability of the inequalities 


2;,<2;.<2,+dz,; and 2z;,<2,<2,+w (h=1...1; h+1%) 


is f(2:) I ee: faten) des) dz 
h+i Zi 
where fi(z;) = (27)-* nto exp oom 9 
t 2 a2 u ; 


This is the probability that the mean of the ith group falls in an element dz;, that the /—1 
other group means are all > z;, and that the range of / meansis <w. As the probability density 
functions f; are different for each group, to find the probability integral of w we must not only 
integrate for z; but also sum for f;. It follows that the probability integral of w, is given by 


l co eth 
Pwo|my..m) = 3 [fled TH ([ faleadden) dee (18) 


This expression may be regarded formally as a continuous function of the n; and expressed, 
by means of a multivariate Taylor expansion, in terms of its value at n; = g(n) (i = 1...1), 
where g(n) is some central measure of the n;. Thus, if An; = n;—g(n), and we write P(n,) for 
P(w|n,, ...%), and P(g) for the same expression when all n; = g(n), we have 
l oP y's An; An, o2P 
P(n,) = P(g) + SAn, —) +3 9) 
i=1 


nN; ee. On, On; 





(19) 
waa oP) . (2Fin) sete 
on; On; ) nyagind, «,.n,<0ln) 


Now P(n,) is pei in the n;. Hence 0P(g)/én,; has the same value for all 7, so that 


l 
¥ An; a a_ fl eP(9) 5 > An;, an expression which will vanish if we choose g(n) = 7. Likewise 


i=l cn; On; i= = 


0?P(g)/(On,0n;) has only the two different values 0?P(g)/¢n? and 0?P(g)/On, On, (h +1). 

















Ne 
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(19) may now be written 


P(n,) = Pa) +> a( Ban 








,OP@) 2P(m) 
ni +z Zan Aman nm, On 
2 .. £ 
DAnt +i (”) 2; “_— 
mPG) soba 
= P(m)+= 3( ant Om, @) EAni+.. 


It is considered that the last term given, which may be denoted by —C(w| 7,1) ZAn?, 
accounts essentially for the difference between P(n;) = P(w | mj, ... n,) and our approximation 
P(n) = P(w|%, ...%).* From (18), we can obtain 


=1 | a zy) (0) an) dy, (20) 


where z(y) = (27)-+e-**, Introduce 


ytwvn 1-1 
duly, wm) = aty)( |" “a(a)an) 
and the operator V defined by . ; 
V(h(y)) = Ay) — Aly + w J) 
(where h(y) is any function of y). After some heavy algebra it can be shown that 


8770 (w | 7,1) = { oo + 2y? — y*) d(y) dy 


1 (0? F®) 5 








= Pa) +5 (Fe 





+[° (22-04 MVeeWl-C-YVEreW] dawhdy 
+(-2)[" (inet)? dred. (21) 


In Table 10a some typical values of 7*C(w|7,1l) are given, evaluated by approximate 
numerical quadrature for w = w,/,/n, where w, is the upper 100a % point of the range w 
in samples of | independent unit normal variates. 7C(w,/./7| 7,1) is from (21) a function 
of / and « only and may be termed C,,. The correction to P(w| 7%, ... 7%) is always negative, 
so that the significance level of w is, as a result of the approximation, somewhat lower than 
100a %. An example of the use of Table 10a is the construction of upper bounds to ZAn?/7* 
and to the coefficient of variation of the ,, supposing that we make the assumption that 
reductions of the significance level from 95 to 94% and 99 to 98-5% are permissible 
(Table 106). It will be seen that as / increases the coefficient of variation permissible to 
achieve the prescribed accuracy slightly decreases. 

As stated earlier, the use of 7 in place of the actual cell frequencies n; hardly affects the 
denominator @/c of the g-ratio employed in method (c). Table 106 will therefore also serve 
in the present case, which is the ‘Studentized’ form of the approximate test just examined, 
and is almost certainly on the safe side. For an illustration, we revert to the example given 
at the beginning of this section. Here 7 = 6-5, 1 = 4, LAn? = 5, so that LAn3?/n? = 0-12, 
a ‘permissible’ value. 


* This has been numerically verified to be a close approximation (for P(%) equal to 0-95 and 0-99) 
in the case 1/= 2 when exact values of P(n;) can be found from tables of the normal integral. It is, of 
course, realized that / = 2 is a rather special case. The third Taylor term of (19) may be shown to vanish 


l 
if & An‘ = 0, and is therefore always zero if | = 2. 
i=1 





Table 10a. Values of C,, at upper 100a % point 
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l 
a 2 4 6 8 10 
0-05 0-057 0-053 0-048 0-043 0-038 
0-01 0-019 0-019 0-018 0-017 0-015 























Note. P(w|n,, ... n;) = P(w|N, ... 7) —C14 x DAn?/n?*. 


Table 106. Maximum permissible values of XAn?/n? and (in brackets) of the 


coefficient of variation of the n,, i.e. of ./(ZAn?/l)/% 



































l 
ot 2 4 6 8 10 
0-05 0-18 0-19 0-21 0-23 0-26 
(0-30) (0-22) (0-19) (0-17) (0-16) 
0-01 0-27 0-27 0-28 0-30 0-33 
(0-37) (0-26) (0-22) (0-19) (0-18) 
APPENDIX 


Use and construction of Tables I-IV 


(1) Table I. This table giving c and v appropriate to the mean of k uncorrelated ranges in samples of n 
is an extension of Patnaik’s (1950) Table 1, which covered the range n = 3—10, k = 1—5. It has been 
obtained by rounding-off. A few corrections for small v were found necessary. For these, c was calculated 
from the exact equation of = 4+, /k 


y= A-14}- 5A - exA?’, 
where A = 2V/M?, which gives v to the required accuracy. 
The extension for k> 5 is based on equations (9) and (10) from which, for large values of pv, 
v=M?/(2V) (9’), andc=M (10’). 
Writing d,, and V,, for the expectation and variance of the range in samples of n independent unit normal 


variates, the variance of the mean of k ranges, each based on n observations, is V = V,/k. Thus (9’) is 
equivalent to y = kd2/(2V,) 
— n ni? 


so that d7,/(2V,,) may be regarded as a ‘constant’ difference (denoted by c.p. in Tables I and II). 


Example. Find c and v for n = 8, k = 12. We have c = 2-86, and from the last two entries in the 
v-column for n = 8 we obtain v = 60-6 + 2(6-03) = 72-7. 


(2) Table II is a similar extension of Hartley’s (1950) Table 5. 
(3) Table III. For the split-plot residual error we find from (13) that 


M?/V = Indy, /{Vnl1 + (nm — 1) puln, m)]}- (13’) 


The quantities vy and c, which are here functions of /, m and n, may be determined from (9) and (10) as 
usual. This would necessitate three-way tables which van, however, be avoided by noting that for the 
double classification of blocks and subtreatments, namely, 


and v (Florin, 1950) from 


Ly = K+ By+ 0,424 (gf=1...m;t=1...n), 
M?/V = nda /{V m{1 +(n—1) palm, m)]}, (22) 


we have 
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so that M?/V in (13’) is 1 times as large as in (22). It follows from (9’) that a rough value of v is given by 
l times the degrees of freedom of the entry in Table II with the same n and k = m, and from (10’) that the 
corresponding c is little changed. In Table III both v and c of Table IT have been slightly adjusted to 
make the determination of the split-plot error v and c as accurate as possible for values of | likely to occur 
in practice. 


Example. Take m = 5, n = 3. From Table III we read off v’ = 7-0, c = 1-53, so that we should take 
y= 7-01. Direct calculation gives: 
forl=2: v= 141, c= 1-54; 
t=3: v=21-1, c= 1-53; 
1=5: v= 35-0, c= 1-52. 


It may be noted that the corresponding unadjusted values of Table IT, namely, v = 7:2, c = 1-57, would 
have given much less accurate values for the scale factor and degrees of freedom of the split-plot residual. 


(4) Table IV. d,,d,/V, and d2/V, are listed for n = 2(1)20. Values of d,/V, and d2/V,, have been 
taken or computed from tables given elsewhere in this issue (Hartley & Pearson, p. 463). d, has been 
included to allow the simultaneous computation on a calculating machine of the sums £d3/V, and 
<xd,,w,/V,, by setting d, and w, at the same time and multiplying both by d,/V,,. 


I am greatly indebted to Dr H. O. Hartley for his kind advice and encouragement 
throughout this investigation and also to Prof. E. 8. Pearson for some suggestions as to 
the improvement of the text. 
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THE FITTING OF POLYNOMIALS TO EQUIDISTANT DATA 
WITH MISSING VALUES 


By H. 0. HARTLEY 


1. INTRODUCTION 


It is well known that the fitting of polynomials to observed data y,,x,(t = 1, 2,...,”) is 
considerably simplified if the x, are equidistantly spaced. Changing scale and origin of x to 
the standard values 

—4(n—-1),..., —1,0,1,...3(n—1) for n odd, 


X= —4(n—1),..., —4$,4,...3(n—-1) for n even, (1) 


the orthogonalization of the powers x} leads to the Chebycheff polynomials which have been 
tabulated by various authors (Anderson & Houseman, 1942; Aitken, 1933; Fisher & Yates, 
1948; Van der Reyden, 1943), and considerably simplify the procedure of fitting. Although 
with observed series the x, are often basically equidistant, situations frequently arise in 
which for certain x, in the series no observed y, are available, so that these missing y-values 
spoil the even sequence of the x-series. As examples we may mention here experimental plans 
providing for observations to be made at regular time intervals in which occasional omissions 
have occurred, or official statistics in which the interval of tabulation was changed (e.g. 
doubled or halved) over the time span under consideration. Such situations can, of course, 
always be dealt with by applying the general technique of polynomial graduation available 
for non-equidistant x, However, it is the purpose of this note to show that in these situations 
a great deal of computational labour can be saved through treating the data by what may be 
regarded as a ‘missing plot technique’ for orthogonal polynomials. It will become apparent 
that the tables of orthogonal polynomials provided in the literature are eminently suited to 
the present procedure. 


2. ORTHOGONAL FIT BY ESTIMATION OF MISSING VALUES 


Let x(t = 1, 2,...,) denote a sequence of equidistant abscissa values which, without loss 
of generality, are taken in the form (1). Let x, denote a subset of n’ of the x, at which 
observations y, are available, and x, the complementary set of n” abscissa values so that 
n=n'+n". To fit a polynomial y = P,(x) of degree k to the y, we take it in the form 


k 
P,(%) = Xa Fi(), (2) 


where the £'(z,) are the familiar orthogonal polynomials for sample size n (e.g. Fisher & Yates, 
1947). Under the usual assumptions of normally distributed independent observational errors 
of equal variance, the a; are determined as the least squares solution of 


Q'(a;) = p> (y, “— ~ a; Ei(x,))? = min. (3) 


The minimization problem (3) can now be modified as follows: simultaneously with the a; we 
estimate the ordinate values y, = uw, (say) corresponding to the z,. Minimizing therefore 


Q(4;, Ug) = X (Y, — 4 Fi(z,))? + X (Ug— 2 ,81(%9))? = min. (4) 
Zr i Lg 
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as a function of the k+1+m2” variables a;, w,, we obtain from 0Q/du, = 0 the conditions 
Up — ~ a,£;(%») = 0, so that the equations 0Q/da; = 0 are identical with the equations 


0Q’/0a, = 0 which the a; must satisfy in the original problem (3). Problem (4) is therefore 
mathematically equivalent to problem (3) and will be solved in two stages: 

(a) For any set of given w, minimize Q(a;, u,) and so obtain solutions a,(wp). 

(6) Substituting these solutions in Q(a;,,u,) obtain Q(a,(u,),u,) and minimize this as 
a function of the uw». The solutions wu, of this problem together with the a,(u,) will be the 
solutions of (4) and hence of (3). 

Now the solutions of (a) are clearly the orthogonal polynomial coefficients for the 
n= n'+n" ‘observations’ y,,u, and are given by 


,(Up) = (Ly, $4(X,) + D Ue Fi(%p))/d £;7(%). (5) 
ir ig x 


The sum of squares Q(a;(u,),%,) is therefore the error sum of squares of the orthogonal 
fit (a), so that the solutions of problem (6) are the ordinates provided by the polynomial 


at x = %p, i.e. by ly = 2 ,(up) E;(x9). ™ 


By substituting (5) in (6) or (6) in (5) we could obtain linear equations for the u, and a,(w,) 
respectively. The most expedient way of obtaining the solutions, however, is to solve the 
system (5) and (6) by iteration, as will be shown in the next section. 


3. AN EXAMPLE ILLUSTRATING THE COMPUTATIONAL PROCEDURE 


In order to illustrate the iteration process we use an example of n’ = 7 observations covering 
an x-range of n = 9 equidistant x, values as shown below: 


— ae ae ee ee le a a 
y, 12 14 . 2 22 . 36 41 50 


We show how to fit a cubic to these data, although we realize that in practice longer series 

would normally be required for such a fit. In columns 1-4 of Table 1 the orthogonal poly- 

nomials £5, £, £, 3; for sample size n = 9, which can be obtained from Fisher & Yates 

(1948), are shown together with the sums of squares > £;7(2z,) given at the foot of each column. 
Zt 


In column 5 the observed y, are entered. The iteration process based on equations (5) and 
(6) is now as follows: 

(i) Estimate trial values for the y in the missing places x, = — 2, 1 and call these U_, = 16 
and U, = 29 (column (6)). 

(ii) Using these to complete the series, compute trial values A; for the orthogonal 
coefficients from (5) (using wu, = U,) to obtain the A; shown in line (11). 

(iii) Using these A, in place of the a, in (6), compute new values for the u, for x, = —2, 1 
and enter the first corrections AU, = u,—U, in column (7). 

(iv) If these are not negligible compute corrections for the A; from 


AA, = DAU, E4(%5)/D E(x) 
Zo xX 


and enter these in line (12). 
Biometrika 38 27 
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(v) If these are not negligible compute second corrections for the U, from 


BU, = DAA,Ei(*%6) 


and enter these in column (8). 


(vi) If these are not negligible compute second corrections for the A; from 


6A;=> BU, 5 i(%o)/% £5?(a) 


and enter in line (13), ete. 


In the example the dU,, dA; need not have been computed. 


Table 1. Fitting of cubic to equidistant series with two missing values 















































Column 
Line 

(1) (2) (3) (4) (5) (6) (7) (8) 
& 1 2 Es Yr Ug | AUs 8U, 

(1) 1 —4 28 nia he ae pon 
(2) 1 —3 7 7 14 a a2 ase 
(3) 1 —2 —- 8 13 — 16 0-05 0-01 
(4) 1 on 17 9 20 orn -= ot 
(5) 1 0 —20 0 22 pei ey pe 
(6) 1 1 -17 -9 -- 29 |-—0-30 |—0-10 
(7) 1 2 —- 8 -13 36 — —_— — 
(8) 1 3 y -— 7 41 — -—— — 
(9) 1 4 28 14 50 — — — 
(10) & &;? 9 60 2772 990 — — — — 
(11) A; 26-67 4-700 0-156 0-002 — —_— — — 
(12) AA; — 0-03 — 0-007 0-002 0-003 — — — — 
(13) 6A, — 0-01 — 0-002 0-001 0-001 — — — 
(14) a, 26-63 4-691 | 0-159 | 0006 | — | — _ _ 








The final polynomial coefficients a; are obtained from a; = A;+AA;+6A;, and are shown 


in line (14). 


4. THE TEST OF SIGNIFICANCE 


Since the a; determined in §2 are the correct solutions of the least square problem (3), the 
k+1 degrees of freedom sum of squares corresponding to the fitted polynomial terms of 


degree 0 to k is given by b 
x a; x Y; £;(x,). 
i=0 Lr 


As a matter of computational convenience, therefore, one would record the sums 
Xy-5;(z,) separately when computing the A; in stage (ii) of § 3. In the example these sums 
Ir 


are as follows: 


Ly, = 195, Ly, g; = 285, Lyf, = 1053, 


Ly, £3 = 55, 
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The resulting analysis of variance table is set out below: 











Formulae Example 
Sum of squares D.F. Sum of DF. 
squares 

Total (including 
correction for mean) xy? n’ 6701 7 

Polynomial terms x a; Dy, &{(x,) k+1 6698 4 

a &r 
Error 2B ly, — BaEiley))* n’—k—-1 3 3 
@r 























As an approximate analysis of variance one could carry out the familiar computation of 
constant, linear, quadratic, cubic, ... terms on the completed series of n terms y,,U,, but 
reduce the degrees of freedom for ‘Total’ and ‘Error’ to n’ and n’—k—1 respectively. Both 
the exact and particularly the approximate analysis of variance can conveniently be carried 
out sequentially by fitting and testing in turn the various polynomial terms in the order of 
their hierarchy. 
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TRUNCATED LOGNORMAL DISTRIBUTIONS 
I. SOLUTION BY MOMENTS 


By H. R. THOMPSON 


Applied Mathematics Laboratory, Department of Scientific and Industrial Research, 
Wellington, New Zealand 


1. INTRODUCTION 


An important contribution to the study of skew-frequency curve problems was made when 
Spiller (1948) showed that many discrete distributions of J-shaped and related forms could 
be normalized by a logarithmic transformation of the enumeration variable. Using the 
device of plotting the cumulative distribution of the observations as probits against 
a logarithmic scale of counts, he found that the distributions became straight lines even 
though sometimes up to 70 % of the data was congregated at the first point. Both Spiller 
and the present writer have since found numerous distributions amenable to the same 
treatment, in all of which the variable is either a count or an occurrence. Examples of the 
former are counts of organisms (fungal spores 01. culture plates) or insects (aphides on leaves); 
of the latter, the frequency of accidents to bus drivers quoted by Yule (1944, table 3-2) or 
the number of words in sentences from different authors (Williams, 1940). Since in these 
discrete distributions we have a discontinuous variate which can take only integral values 
from 0 to infinity, it is obvious that the frequency of r counts per sampling unit must be 
represented by the equation 


(a — 2)? 


l log (r+1) o 
[ ~-(-=SF) dx, 2 Yr = 1, (1-1) 
\ ” r= 


oe FRR), 


logr 
where log (r + 1) is the normalizing function (all logarithms to base 10) and , o are parameters 
of the parent normal distribution. In a lognormal distribution with continuous variate, 
the curve (after transformation) is recognizably normal throughout. In the discrete case, 
however, the number appearing as zeros represents the integral over the range from —oo 
to 0, giving a markedly skew appearance to the curve. This, combined with the fact that 
negative values cannot explicitly occur, led to the descriptive name ‘truncated lognormal 
distribution’. 

The inadequacy of some attempted methods of representing this type of distribution by 
other theoretical models is well shown in Upholt & Craig (1940), whose data on black scale 
on citrus twigs is fitted very well by a truncated lognormal (see Spiller). The expected 
Poisson and Neyman Type C contagious distributions calculated by Upholt & Craig give 
absurdly high values of y*, whereas a truncated lognormal as in (1-1) with ~ = 0-439, 
ao = 0-609, gave x? = 11-3 for 7 degrees of freedom (P = 0-13), indicating a very reasonable 
agreement. A comparison of observed and expected values is given in the table on p. 415. 

Bliss (1937) observed instances of continuous truncated lognormal distributions in time- 
mortality data, truncation being either artificial (due to stopping the experiments before 
mortality was complete) or inherent (due to non-reaction of individuals). The data consist 
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—y)2 
(a >a, the point of truncation) and N — n with total probability ae [" exp ( om = ) dx, 


which lie somewhere in the truncated portion x <a. Bliss solved the distribution by maximum 
likelihood, computing corrections to a preliminary graphic estimate of the parameters by an 
iterative process. The calculations entailed in this method are fairly involved, and could 
only be reduced by the use of an extensive bivariate table giving the point of truncation in 
terms of the fraction in the truncated portion and some function of the first two moments. 
For such a table see Hald (1949), the distribution being termed by him ‘censored’. 





| 
No. of scales per 0 | = 2 | 3 4 5 | 6 7 8 | Over 
| 


| unit sample 
| 





36 30 14 159 





65 42 46 


| 
Observed frequency | 199 124 106 
of samples | 


144 94 


66 49 38 30 25 20 163 

















| 
| | 
| | 
| Expected lognormal| 193 | | 
| distribution | | | 
| | | 


| 





Gjeddebaek (1949) considered the maximuin likelihood solution of the discrete normal 
distribution described (using our notation) by the equation 


6 ] }- Ti+ ’ _ («—4)P 
«= 576m), exp( 207 ) ae, 


of which (1-1) may be regarded as a special case. The beauty of this method lies in the 
simplicity of the equations of maximum likelihood. If n; observations of the total sample 
of » fall in the ith class interval (x;,2;,,) their probability is g; from above, and from the 
likelihood function of the observations the equations for estimating 4, o respectively reduce 


+1 , _ ree 1 
- -=0, &n,; ge = 0, where ee a % = Tm) 
special tables giving (z;—2;41)/9;, (t:%:—ti41%41)/@; for values of t;=2, t,,=x+y, 
Gjeddebaek reduces these to the computational form &n;z,; = 0, &n;z,; = 0, in which case 
the variance matrix is also very simply expressed: 


o? (X4q;24; 29: %1:22i\* 
var (4,0) = n Shae 2g; 23; 








2 41 £2; —b 412% 








to Ln; e4, By means of 


However, the actual mechanics of the solution are even more laborious than Bliss’s method, 
the process again being iterative; when there are more than say four or five class intervals 
the method becomes almost impracticable. 

The essential requirement of a practical method is that it should be reasonably easy to 
apply, avoiding the laborious calculations inherent in iterative methods. The technique 
described in the present paper estimates by the method of moments the parameters 1, o of 
the discrete frequency distribution (1-1) by an approximation to the equivalent continuous 
truncated lognormal distribution. Corrections to these estimates have to be applied owing 
wo the unequal size of the logarithmic intervals from which the moments are calculated. An 
examination of the efficiency of the moment solution compared with that of maximum 
likelihood will be made in a later paper. In results so far obtained, the comparison is not 
unfavourable. 
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2. SoLUTION 


Consider a continuous random variable x, distributed normally with mean y, standard 
deviation o, but truncated at x = a so that the frequency function is of the form 


y=90 (<a) ) 


1 (x—p)? 
o (2m) exp(-“2") wae 


with a discrete mass point equal to 





(2-1) 
1 , (x—p)? oa 
= Jan) [" exp (- 5o2 ) dz at x=a. 


co 
Hence | ydx = 1, The assumption that the truncated frequency is accumulated at the 
i) 








/ 


point of truncation follows from the consideration that in the discrete frequency 
distribution which forms the basis of the paper where the truncation is at x = 0, negative 
values of the enumeration variable cannot occur. 

This distribution may be solved for uw, 7 by equating the sample moments 1}, 4, to those 
of the theoretical distribution. Taking moments of (2-1) about the point of truncation a, 
we obtain easily the following expressions: 








y(@) = (z—tg)o, . (2-2) 
Ha(@) = (q—t2 +g) 0°, 
where t= <—F, z= Tent q= [rea (2-3) 
_ (2—tg)? _ {u;(a)}? ' 
Write ¢,= ae 71:(a) ‘ (24) 
o, = 2-4. (2:5) 


Then ¢, is independent of o and is a function of only one unknown, t. Table 1 gives ¢, for t, 
proceeding by intervals of 0-1 from +2-0 to —4-0; from this table ¢ may be obtained by 
inverse interpolation if ¢, is given. The values of ~, « may then be determined with the help 
of ¢,, also tabulated against ¢ in Table 1. Thus for a sample of n observations of x, from the 
population (2-1), the parameters ~,a may be estimated as follows with the minimum of 
computation: 


(i) From the sample moments 


wi(a) = Z(e—a)|n, pia) = L(e—a)?/n, 
estimate ¢, from (2-4). 
(ii) Hence obtain ¢ from Table 1. 


(iii) Using 3, from Table 1, calculate 


o = pi(a)/d,, “= a—ot. 


Fort < — 4-0, the population may be regarded as non-truncated, in which case the parameters 
are estimated in the usual way from w = X(x)/n = %, o? = X(x—Z%)*/n. 
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Table 1. Functions ¢,, ¢, of the normal distribution 














































































































t ge A, A, ¢, A, A, t gs A, A, dg; A, A, 
2-0 | 0-0125 0-0085 —1-0 | 0-6098 1-0833 
1-9 | 0-0158 ’ 7 | 0-0111 = 7 |.-1-1 | 0-6353 a —12 | 1-1686 a 22 
1-8 | 0-0199 | 4) 8 | 0-0143 | 37 | 8 | —1-2 | 0-6595 | 588 | —13 | 1-2561 | B42 | 19 
1-7 | 0-0248 | £9| 10| o-o183| $9 | 9 | —1-3 | 0-826 | 59° | —13 | 13455 | BOT | 17 
1-6 | 0-0307 | 98 | 11 | o-0232 | 9? | 11 | —1-4 | 0-7043 | 317 | —13 | 1-4367 | 956 | 15 
1-5 | 0-0376 12 | 0-0293 13 | —1-5 | 0-7247 —13 | 1-5293 13 
1-4 | 0-0457 4 13 | 0-0367 - 15 | —1-6 | 0-7438 a4 ~13 | 1-6232 je il 
1-3 | 0-0551 | 53 | 14 | 0-0455 | 6? | 17 | -1-7 | 0-7e16 | 478 | —13 | 1-7183 | 965 | 9 
1-2 | 0-0659 | 198 | 15 | 0-0561 | 198 | 19 | —1-8 | 0-7782 | 186 | —12 | 1-8143 | Sep] 8 
1-1 | 0-0782 | 535 | 16 | 0-686 | 122 | 22 | -1-9 | o-7935 | 193 | -12 | 10111 | Qos | 7 
1-0 | 0-0921 | .__ | 16 | 0-0833 25 | —2-0 | 0-8077 —11 | 2-0085 5 
0-9 | 0-107 | 125 | 17 | O-1004 | 171 | 27 | —2-1 | o-s208 | 131 | -10 | 21065 | 980 | 4 
0-8 | 0-1249 | 175 | 17 | 0-1202 | $98 | 29 | —2-2 | o-g329 | IT} | —10 | 22049 | O88 | 4 
0-7 | 0-1438 | 503 | 17 | 0-1429 | 527 | 31 | —2-3 | o-8aao | 145 | — 9 | 23037] Gor | 3 
e e e dite le 9 —_ ° 
0-6 | 0-1644 | jo | 16 | 0-1687 | 58 | 33 | —2-4 | o-g542 | 10% 8 | 2-4027| O03 | 2 
0-5 | 0-1866 | . 15 | 0-1978 35 | —2:5 | 0-3636 — 8 | 2-5020 2 
0-4 | 0-2104 | = 14 | 0-2304 - 37 | —2-6 | 0-8722 > — 7| 2-6015 re 1 
0-3 | 0-2356 | 50° | 13 | 0-2668 | 303 | 38 | —2-7 | o-sgo1 | 73) - 6) 2-7011| 957 | 1 
0-2 | 0-2621 a76 11 | 0-3069 | 44, | 39 | —2:8 | 0-8874 | 47 | — 6 | 28008 | 996 1 
0-1 | 0-2897 | 376 9 | 0-3509 | $40 | 40 | -2-9 | o-soai-| 85 | — 5 | 29005 | O99 | 1 
0-0 | 0-3183 | 7 | 0-3989 | _.. | 40] —3-0 | 0-9002 — 5 | 3-0004 
—0-1 | 0-3476 | 593| 5 | 0-509 | 520 | 40 | —3-1 | o-9059| 93 | — 4| 31003 | 3o9| | 
—0-2 | 0-3775 | 309 3 | 0-5069 | 560 | 39 | -3-2 | o-o111 | 93 | — 4 | 32002 | G99 | 
—0-3 | 0-407 | 303 1 | 0-5668 | 999 | 38 | —3-3 | o-9160 | 4% | — 4 | 3-3001 | 1969 
—0-4 | 0-4379 | 307 | — 1 | o-6304 | 637 | 37 | —3-4 | o-9204 | 47 | — 3 | 3-4001 
0-5 | 0-4680 | 49, | — 3 | 0-6978 | ... | 35 | —3-5 | 0-9246| 4. | — 3 | 35001 
—0-6 | 0-4978 | 595 | — 5 | 0-7687 | 799 | 33 | —3-6 | o-0284 | 35 | — 3 | 3-6000 
—0-7 | 05270 | 52? | — 7 | 0-8429 | 74° | 31] —3-7 | o-9319 | 39 | — 3) 3-7000 
—0:8 | 0-5555 376 — 9 | 0-9202 | 055 | 29 | —3-8 | 0-9352 | 3) | — 2 | 3-8000 
== (}- 5 wu ° 9 one le ara 2. 
0-9 | 0-5831 | 355 | —10 | 10004 | $9 | 27 | —3-9 | 0-383! 96 2 | 3-9000 
~1-0 | 0-6098 —11 | 10833 | | 25 | —4-0 | 0-9412 | — 2| 4-0000 
Table 3. Values of x = $[logr + log (r + 1)] and x? 
For r> 100, the error involved in making the approximation x = log (r+ 4) is negligible. 
r x 2? r a Tr x 2? r x 2? r x a? 
1 | 0-151 | 0-023 | 21 | 1-332 | 1-775 | 41 | 1-618 | 2-618 | 61 | 1-789 | 3-200 | 81 | 1-911 | 3-652 
2 | 0-389 | 0-151 | 22 | 1-352 | 1-828 | 42 | 1-628 | 2-652 | 62 | 1-796 | 3-225 | 82 | 1-916 | 3-673 
3 | 0-540 | 0-291 | 23 | 1-371 | 1-880 | 43 | 1-638 | 2-685 | 63 | 1-803 | 3-250] 83 | 1-922 | 3-693 
4 | 0-651 | 0-423 | 24 | 1-389 | 1-930 | 44 | 1-648 | 2-717 | 64 | 1-810 | 3-274 | 84 | 1-927 | 3-713 
5 | 0-739 | 0-545 | 25 | 1-406 | 1-978 | 45 | 1-658 | 2-749 | 65 | 1-816 | 3-299 | 85 | 1-932 | 3-732 
6 | 0-812 | 0-659 | 26 | 1-423 | 2-025 | 46 | 1-667 | 2-780 | 66 | 1-823 | 3-323 | 86 | 1-937 | 3-752 
7 | 0-874 | 0-764 | 27 | 1-439 | 2-071 | 47 | 1-677 | 2-811 | 67 | 1-829 | 3-346 | 87 | 1-942 | 3-771 
8 | 0-929 | 0-862 | 28 | 1-455 | 2-116 | 48 | 1-686 | 2-842 | 68 | 1-836 | 3-370 | 88 | 1-947 | 3-791 
9 | 0-977 | 0-955 | 29 | 1-470 | 2-160 | 49 | 1-695 | 2-872 | 69 | 1-842 | 3-393 | 89 | 1-952 | 3-810 
10 | 1-021 | 1-042 | 30 | 1-484 | 2-203 | 50 | 1-703 | 2-901 | 70 | 1-848 | 3-416 | 90 | 1-957 | 3-828 
| | 
11 | 1-060 | 1-124 | 31 | 1-498 | 2-245 | 51 | 1-712 | 2-930 | 71 | 1-854 | 3-438 | 91 | 1-961 | 3-847 
12 | 1-097 | 1-202 | 32 | 1-512 | 2-286 | 2 | 1-720 | 2-959 | 72 | 1-860 | 3-461 | 92 | 1-966 | 3-866 
13 | 1-130 | 1-277 | 33 | 1-525 | 2-326 | 53 | 1-728 | 2-987 | 73 | 1-866 | 3-483 | 93 | 1-971 | 3-884 
14 | 1-161 | 1-348 | 34 | 1-538 | 2-365 | 54 | 1-736 | 3-015 | 74 | 1-872 | 3-505 | 94 | 1-975 | 3-902 
15 | 1-190 | 1-416 | 35 | 1-550 | 2-403 | 55 1-744 | 3-042 | 75 | 1-878 | 3-527 | 95 | 1-980 | 3-920 
16 | 1-217 | 1-482 | 36 | 1-562 | 2-441 | 56 | 1-752 | 3-070 | 76 | 1-884 | 3-548 | 96 | 1-985 | 3-938 
17 | 1-243 | 1-545 | 37 | 1-574 | 2-477 | 57 | 1-760 | 3-096 | 77 | 1-889 | 3-569 | 97 | 1-989 | 3-956 
18 | 1-267 | 1-605 | 38 | 1-585 | 2-514 | 58 | 1-767 | 3-123 | 78 | 1-895 | 3-590 | 98 | 1-993 | 3-974 
19 | 1-290 | 1-664 | 39 | 1-597 | 2-549 | 59 | 1-775 | 3-149 | 79 | 1-900 | 3-611 | 99 | 1-998 | 3-991 
20 | 1-312 | 1-720 | 40 | 1-607 | 2-584 | 60 | 1-782 | 3-175 | 80 | 1-906 | 3-632 |100 | 2-002 | 4-009 | 
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The extension to the solution of the discrete distribution (1-1) is made by calculating the 
parameters as if the distribution were in fact continuous, and applying corrections to the 
estimates for the discontinuities caused by grouping. In the case of a continuous variable 
grouped into intervals of equal size and with contact of high order with the x-axis at both 
terminals, the corrections to the raw or grouped moments are functions only of the size of the 
grouping interval. For a distribution such as (1-1), however, regarded as being derived from 
the continuous case, there is a lack of high contact at the lower terminal 0 and, in addition, 
the grouping intervals, being logarithmic, are of unequal size. The unequal class intervals 
would in themselves lead to very complicated correction formulae, but the lack of high 
contact means that the Euler-Maclaurin formula on which Sheppard’s corrections depend 
is invalidated. This follows since for the normal function e~** the necessary condition that 
derivatives of all orders vanish at the terminal does not hold; they do not even converge 
when the terminal frequency is finite as in a truncated distribution. Any modification of the 
Sheppard corrections such as Hartley’s (1950) based on the Gregory integration formula 
is also inapplicable generally for a grouped truncated normal. 

If then for a theoretical distribution with given ~, o we calculate moments using the 
mid-points of successive logarithmic intervals and then estimate yw, o as if the distribution 
were continuous, we shall obtain biased estimates jy’, a’ say, in error by amounts depending on 
the values of u, a. Since the magnitudes of these errors cannot be determined as explicit 
functions of « and o or the first two moments, they have been obtained from a series of known 
discrete lognormal distributions with n = 10,000 (Fisher & Yates, Statistical Tables, table IX), 
varying and a so as to cover the range of y’, a’ given in Table 2. The table has been extended 
to include low values of o’ and high negative values of y’ (the latter corresponding to 80 % or 
more zeros), although they will seldom if ever be needed, and the corrections are tabulated 
for interest only. Over the rest of the table, where the numerous experimental distributions 
studied by the author have fallen, second difference interpolation will give an accuracy 
commensurate with that of the table itself. For this purpose, second differences for each 
entry have been tabulated where necessary (i.e. omitted if they have no influence) as shown 


below: f(u',o’) Az, f(u',o' +01), 
iB, Be (2-6) 
f(“’'+0-1,0’) f(“' +0-1,0'+0-1), ete., 
where Mew =f(H',0') +f(w’ + 0-2, 0’) — 2f(u' + 0-1, 0’), (2-7) 


and similarly for A2.,,, while 
Meg =f(u',0')+f(u' +01, 0 +0-1)—f(w' +0-1,0’)—f(w',o’ +01). (2-8) 
A suitable formula for interpolation (where 0 < m,n < 1) is then 
f(u' + 0-Im, o’ + 0-In) = f(u’, 0’) + mA, + nd, + mndjng — ¥m(1 —m) Aj, — 4n(1 —n) A2,,, 
(2-9) 
from which terms may be omitted if they do not affect the final result. Often sufficient 
accuracy will be obtained with simple linear interpolation. 


3. EXAMPLES 


The method of solution of the discrete lognormal distribution is illustrated by two examples 
from biological sources. The first is a record of counts of penicillia spores observed on Petri 





et 


























———— 











H. R. THompson 421 








































































































Example 1. Spores of penicillia on Example 2. Aphides on forty leaves 
506 Petri dishes. of variety Osborne Park. 
No. of No. of 
spores per | Observed x 2? aphides | Observed c a 
dish (r) frequency per leaf (r) frequency 
0 277 _ — 0 9 mee pam 
1 117 0-151 0-023 1 11 0-151 0-023 
2 42 0-389 0-151 2 4 0-389 0-151 
3 22 0-540 0-291 3 5 0-540 0-291 
4 15 0-651 0-423 5 2 0-739 0-545 
5 12 0-739 0-545 7 1 0-874 0-764 
6 8 0-812 0-659 8 2 0-929 0-862 
7 2 0-874 0-764 10 2 1-021 1-042 
8 3 0-929 0-862 11 1 1-060 1-124 
9 2 0-977 0-955 27 1 1-439 2-071 
10 2 1-021 1-042 | 43 1 1-638 2-685 
11 1 1-060 1-124 222 1 2-347 5-510 
18 1 1-267 1-605 
22 1 1-352 5 a | Si 
28 1 1-455 2116 | | Total 40 
Total 506 
Statistic Example 1 Example 2 
mw 0-16735 0-4663 
Mh 0-09560 0-4841 
bs 0-2930 0-4492 
t 0-0885 — 0-4375 
1 0-3562 0-6553 
o’ 0-4698 0-7116 
yw’ —0-0416 0:3113 
Corrections to i pid — 
yh’ —0-0199 —0-0017 
0-4869 0-7149 
hh —0-0615 0:3096 
Example 1 Example 2 
Class interval 0 1 2 3 4 5 |6-7|/8and] 0 | 1; 2 | 3-4 | 5-9 |Over 
over 9 
Observed frequency | 277 | 117 | 42 | 22 |15 |12 |10 | 11 9 {11 |4 |5 {5 |6 
Expected frequency | 278-4 | 112-1 | 47-5 | 24-2 | 13-8 7) a 12-0 | 13-3 | 6-5] 3-9 | 4-6 | 5-0 | 6-7 
xj) = 2°62, P = 0-76 xis) = 4°61, P = 0-20 


dishes exposed to the atmosphere for 5min. at six localities in Auckland over a period of 
12 months. The second represents counts of the common carrot aphid (Cavariella aegopodit), 
part of a varietal infestation trial on carrots. Samples of forty leaves were taken, the results 
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for one variety being reproduced here. The successive steps in computing the mean and 
standard deviation of the parent distributions are set out in a convenient form in the tables 
on p. 421. For the calculation of moments, the distributions have been tabulated with the 
counts in ascending order and the variate values x = }[logr+log(r+1)] (i.e. the mid- 
points of the frequency intervals), obtained from Table 3 are placed alongside. The rest 
of the calculations follow exactly the order explained in §2. A comparison of observed 
with expected frequencies is made, and the values of y? obtained indicate no significant 
departures from hypothesis in the two samples. 


The author wishes to thank Dr T. R. Vernon and Mr K. P. Lamb, of Plant Diseases Division, 
Auckland, for permission to use their data on spore counts and aphid counts respectively. 
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A CHART FOR THE INCOMPLETE BETA-FUNCTION AND THE 
CUMULATIVE BINOMIAL DISTRIBUTION 


By H. 0. HARTLEY anp E. R. FITCH 


1. PURPOSE OF THE CHART 


The present chart gives values of the Incomplete Beta-Function 
z 1 
P = 1,0) = [esa —gy-tag/{ "gs —ep-rag (1 
0 0 


to graphical accuracy. It thereby provides, in compact form, approximate answers to the 
numerous problems in which this function arises. Of these we only mention the following 
two examples. 


(1-1) The cumulative binomial distribution 


If in repeated, independent trials the expected proportion of successful trials is p, find the 
chance P of observing at least a successful trials in a total of n trials. This chance is given by 


P= = (") pi(l—p)"* = L,(a,n—a+1). (2) 


(1-2) The distribution of the variance ratio F 


If s? and s? are two independent sample variances based on vy, and v, degrees of freedom, 
find the chance that the variance ratio F = s?/s3 exceeds a given level F,. This chance is 
given by 

P(F > Fy) = 1,,(4¥2, 4%), where 2% = V/(v2+V,F)). (3) 


In certain applications we require the ‘percentage points’ of z, i.e. the roots x of I,(a, b) = P, 
for given values of P, a and b. These can be obtained by sight from the chart as also can 
be the confidence points and percentage points of the equivalent distributions (1-1) and (1-2). 

In using the chart, the usual restriction a >b is made, but values for a <b can be obtained 
from 
I(a, 6) = 1--L,_,(6, a). (4) 


2. SCALES OF THE CHART 


On the top edge of the chart, a scale of b (= 4v,) is provided which is split into three sections: 
A, 1<b<4; B,4<b< 15; C, 15<b< 60. These sections form the upper margins of the three 
corresponding chart sections A, B and C. Each of these three sections is further divided into 
upper and lower parts. The lower part contains a family of curves corresponding to the 
values of the parameter a shown, while the toy) part contains a family of curves corresponding 
to selected values of the probability P. Both families are referred to the same abscissa scale 
b at the top edge. The two families of curves are linked by the aid of x-scales provided on 
the ruler enclosed in the pocket at the end of the volume. Each z-scale is marked with an 
arrow (—) at the pivot point, whose use is explained below. 
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3. USE OF THE CHART 
(3-1) To find P = I,(a,b) for given a, b and x 

Choosing the appropriate scale (A, B or C) place the ruler vertically on the chart, so that its 
x-scale intersects the abscissa scale (at top of chart) at the given value of b. Slide the ruler 
up or down this vertical until its pivot mark (—) is at the intersection of the vertical with the 
curve corresponding to the given value of a in the bottom part. With the z-scale in this 
position, the required value of P = I,(a, b) can be read off in the top part opposite the given x. 

Examples. (1) Givena = 30, b = 5, it is required to find [,(a, 6) for x = 0-70, 0-94 and 0-96. 
Using section B we find: 








For comparison, 
* From chart | from tables of I,(a, b)* 3 
0-70 0-012 0-012 
0-94 0-95(2) 0-949 
0-96 0-99(0) 0-989 

















* Tables of the Incomplete Beta-Function, edited by K. Pearson (1934). 


(2) In binomial theory (see (1-1) above), given that p = 0-2 and n = 25, it is required to 
find the chance of obtaining, at least, a = 10 successes. This chance is given by J,.,(10, 16) 
or 1—4,,(16, 10), so that, using section B, we find it to be equal to 1 — 0-985 = 0-015. 


(3-2) To find the percentage points of the Incomplete Beta-Function 


(1) Given a = 4and 6 = 2, it is required to find the upper and lower 10 % points. Using 
section A we find: 





P For comparison, 
° from tables I,(a, b) 





0-90 0-89 0-888 
0-10 0-42 0-416 

















(3-3) T'o find the probability integral of F and to evaluate the 
percentage points of the distribution 
Using the relation (3), we proceed as in (3-1), putting a = 4v,, b = 4v, and vas given by (3). 
Example. Given v, = 40, v, = 120, find the chance that F > 1-5. We have 
a=60, b=20, x = 120/180 = 0-667, 


and hence, using section C, we find P{F > 1-5} = 0-05(4). Merrington & Thompson’s (1943) 
tables of the percentage points give F = 1-495 for the upper 5 % point of F. 


4. THEORETICAL BASIS OF THE CHART 


The Incomplete Beta-Function involves the four parameters a, 6, x and P. 
In K. Pearson’s (1934) tables P is tabulated as a function of x and a (called p) in a series 
of double-entry tables progressing by values of the fourth parameter 6 (called q). In con- 
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sidering any other form of tabulation or graphical representation of this four-parameter 
relationship it would, at first sight, appear that a series of tables or charts would be required. 
The reduction of the data to a single sheet is here achieved with the aid of the approximate 
formula 
— log ( — log z) = f(P,b)+g(a, b), (5) 
where f(P, 6) = log {v,/x?(P, 2b)}, 
g(a, b) = log {}(2a — 1)/b + 3} —log {1 + $(6— 1) (26+ 1)/(2a+6—1)%}, 


and x°(P, 2b) is the 100P % point of x? for 2b degrees of freedom. Thus log (—logz) is 
represented as a sum of two functions each involving only two parameters, and the chart 
is an obvious representation of this relationship. 

Formula (5) was derived from an approximation recently published by Wise (1950). 
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THE EXPECTED FREQUENCIES IN A SAMPLE OF AN ANIMAL 
POPULATION IN WHICH THE ABUNDANCES OF SPECIES 
ARE LOG-NORMALLY DISTRIBUTED. PART I 


By P. M. GRUNDY 
Rothamsted Experimental Station 


INTRODUCTION 


The sampling of animal populations by traps or other methods of random collection leads to 

interesting statistical problems, one of which is the subject of this paper. It is assumed that 

the counts from different species are independent Poisson variates, as occurs when all 
individuals have smalland independent chancesof capture. Accordingly, thereis a probability 

e-™ m? |r! 

of catching r individuals of any given species, m being the expected number. Here m may 
be called the abundance of the species concerned, but it must be recognized that this means 
abundance relative to both the intensity and the method of sampling. As in the paper of 
Fisher, Corbet & Williams (1943), the structure of the population may be investigated by 
studying the numbers of species represented in the sample by various numbers of individuals. 
The expected number of species with r individuals in the sample is 


n, = S(e-™ m’/r!), (1) 
where S denotes summation over all species in the population. 
In practice the number of species is large, and for theoretical purposes the frequency 


distribution of their abundances is taken to be continuous. Three such distributions which 
have been suggested are: the Eulerian or Type III distribution 
ae—hm ms! dm, 

the limit of this as e—> 0, and the log-normal distribution. The first of these alternatives leads 
to a negative binomial series of n,, and the second to the logarithmic series investigated by 
Fisher et al. (1943); on the properties of these and related distributions, cf. Anscombe (1950). 
Observational evidence in favour of the normal distribution of logm has been adduced 
by Preston (1948); in the same paper he has worked out some examples of expected sample 
frequencies, using a discrete approximation to the normal distribution. His results indicate 
that the expected sample frequencies, grouped along the scale of r by octaves, conform well 
to the population curve after the first few octaves. 

The object of this paper is to give tables and formulae for calculation of the n, when m has 
a given log-normal distribution. Information of the kind thus provided, especially regarding 
the frequencies of rare species, is becoming increasingly necessary for the comparison of 
theory with the observed results as these accumulate. It is convenient to work with the 
proportions ¢, = n,|N, (2) 


where N denotes the population number of species, 


N = Np + Ny + Ng + eevee 
Biometrika 38 28 
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In Part I, Tables 1 and 2 give the expected values of the proportion of species occurring in 
the sample, 1 —¢,, and of the proportion of singletons, ¢,. The other ¢, can be calculated, 
over the range of a and o? anticipated, by an approximate method which will be described 
in Part IT; it is only for ¢, and ¢, that the labour of constructing explicit tables appears to be 
justified. An approximate formula relevant in the fitting of the distribution to observational 
data is also in sight, but this cannot be discussed at the present stage. 

The notation used for the parameters of the distribution of abundance is as follows. The 
variance of the natural logarithm In m is denoted by o?; the distribution of In m is assumed 
to be normal with mean Ina, so that a denotes the median abundance. Thus for all r>0 


ee I Pea fm) dm, (3) 
0 


r! 


where f(m) = waa om (ie (in =)" ‘ (4) 


The expected number of individuals in the sample (the integral of Nmf(m) dm, which reduces 
to a formula involving the normal integral) is Na et’. 

The Jahnke & Emde tables (1945) include a table, due to Wagner (1913), of the ‘after- 
effect function’ (a ,/(27)/a) ¢, as a function of a and 1/o ./2. However, both the range and the 
irregular spacing of the values of a make that table inconvenient for application to the 
present problem. A fresh table of ¢, has therefore been calculated (Table 2), as well as the 
table of 1— ¢, (Table 1), proceeding at equal intervals of log a = log,,a. The methods used in 
the calculation and checking of these tables are discussed below. 





INTERPOLATION 


The probits (i.e. normal equivalent deviates + 5) of the entries in Table 1 have fairly small 
and slowly varying second differences with respect to loga, and it is advantageous to use 
the probits in interpolation with respect to loga. A similar advantage is obtained by taking 
the probits of the differences between corresponding values in Tables 1 and 2 (i.e. by using 
the probit of 1 — ¢)— ¢,) in interpolation with respect to log a. The transformation to probits 
is suggested by Preston’s approach (1948) (cf. Fig. 1 of the present paper). However, the 
transformation is not advisable in interpolation with respect to o. 


Prosits oF TABLE 1 


Preston (1948) has introduced the concept of the ‘veil line’ separating the less abundant 
species (m < 1) from those more abundant (m > 1). Heuristically, the sample may be supposed 
to include most of the species to the right of the veil line, and few of those to the left. This line 
of thought suggests examination of the probits of the entries in Table 1, and leads to the 
expression* Rais 


5+ a. (5) 
as a possible approximation to the probit of 1—¢,. In this context it was suggested to the 


author by Dr C. B. Williams that the level of abundance m = 4 might give a better position 
for the veil line. This corresponds to replacing Ina by In 2a in the above formula, but it has 


* Preston does not, however, seem to use this, since he adopts a method of grouping into octaves by 
which half the species appearing as singletons in the sample are assigned to the left of the veil line. 
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been found that some adjustment of the variance is also desirable. Over the range of Table 1 
the empirical formula po hi Ss a 
(a? + 1-4) 

gives about as good an approximation to the probit of 1 — ¢, as is possible for a function linear 
in Ina. There are also theoretical reasons for using an approximation of this type, which the 
author hopes to discuss in a future paper. A comparison of the correct values (based on 
Table 1) with the expressions (5) and (6) is shown in Fig. 1, and it will be seen that for the 
rather small variance 0? = 2 the graph of the probits shows appreciable curvature; when 
o* = 5 the curvature, though perceptible, is very slight. 
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Fig. 1. Proportion of species expected in the sample. Comparison of correct values (Table 1) 
with approximate formulae, on probit scale. 


METHOD OF CALCULATING TABLES 1 AND 2 
The substitution x = In m throws the integral (3) into the form 


?, as et ea etre («—In a)* dx. (7) 


In the cases r = 0 and 1, this integral was evaluated by the approximate formula 


g(x) de=hS g(nh) (h>0O). (8) 
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The error involved in (8) is discussed below, and it is shown that in the application to (7) 
over the range of Tables 1 and 2 the error is negligible when 


h = }In 10 = 0-575646273. (9) 
Accordingly, to the required accuracy, ¢, is given by 


co 


>» Un@n—ks (10) 
and 1 by >» Un*n—k» (1 1) 
where k =h Ina = 4loga, 


nh A 
u,=he™, v, =e'u 


2, = aii exp [—n7h?/20°], 
and h is given by (9). 

The sums (10) and (11) were calculated by the use of punched cards, with a Hollerith 
Junior Rolling Total Tabulator. The values of u,, once punched, were repeatedly fed into 
thxe reproducing punch, and in that way a master pack was constructed in which the card 
with serial number n(= — 34 to +22) carried the values u,,_ 9, Up_7, ---, Unig The first main 
pack was obtained by gang-punching these values on to 15 sets of cards carrying the values 
of z,, for the 15 different values of o?. Eventually four main packs were constructed, in which 
the cards carried (besides c? and other referencenumbers) the following values:z,,,Un_g; -++;Un+2 
in the first pack; z,, U,+9, ---» Un4ig in the second; and similar values with u replaced by v in 
the third and fourth packs. The results were then obtained by progressive digiting. The full 
number of control-breaks required for that process was ensured by the insertion of sub- 
sidiary packs, so that the whole of each main pack could be included in a single run through 
the tabulator. 

Values of z,, correct to six decimal places, and of wu, and v, correct to five, were used, the 
data being obtained by interpolation in the New York W.P.A. tables (1939, 1942). Since Xz, 
was less than 1-8 for all relevant o?, the error due to rounding the w, and v, was less than 
0-000009. The sum of the absolute rounding errors in the z,,, restricted to n < 12 because the 
rounded w,,,, and v,,, vanish when n > 12, is easily seen to be less than 0-000025. Since the 
u,, and v,, were less than 0-58, it follows that the further error in (10) or (11) due to rounding 
of the z, was less than 0-000015. The actual rounding errors (previous to rounding of the 
final results to four decimal places) were probably less than half as much. 

The remaining source of error, the approximation (8), has already been mentioned. This 
error (for any h > 0) is in general given by Poisson’s summation formula, namely, 


n? 


i. g(x) da = h = g(nh) - 3 (07) n o(=")}, (12) 
where Qy) = r. ev 9(xr) dx. (13) 


Sufficient conditions for the validity of (12) are (Titchmarsh, 1948, Theorem 45) that, in the 
whole range —0o <x <00, g(x) be continuous and expressible as the difference between two 
bounded non-decreasing functions, and that the integral on the left-hand side of (12) converge. 














(7) 


(9) 


(10) 


(11) 
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We take g(x) to be the integrand of (7), and note that it is the product of 


9i(z) = exp(—e* + 32), 
and = : : Ina)? 
G2(x) = Te Iom %P rH 2-55 na) | 


To these functions correspond the known transforms 


Oily) = T(3 +%y), 


Qaly) = 5a HV exp [Jo%(r— 4+ iy). 


Now Qy) = 5 3 Qi(y — wv) Qo(w) dw 


(Titchmarsh, 1948, p. 51). Taking r = 0 or 1, we have 
1 : : 
|Q)| <zearFexp igo" [" exp[—do%w'] | P+ iyi) | de, 


It is shown in text-books that | P'(4+iw) | = ./(asech zw) for real w, and since sech mw < 2e-7” 
it follows that 


| Q(y)| < a a’ exp [407] .. exp [—4(o?w? + 7y —mw)] dw 


2 2 
= a’-*g-1 exp IS — ay sail . 


2 * Bo2 
A similar inequality with the sign of y changed is obtained from sech mw < 2e™, and thus 
2 7 
l rtg-l ER ET. 
| Q(y) | <a o exp (5 +g: sly}. 


Substitution in (12) shows that the error in (8), applied to ¢p or ¢,, is less than 
“ 
9nt-ta—1 Selec is Seeman 2 = 
2a’—-to exp(5 +) [exp [m?/h]—1). 
Over the range of Tables 1 and 2, with h given by (9), this quantity is less than 0-000005. 


CHECKS 


The values of z were checked by summation, using a known result of applying (12) to the 
normal curve, that &z,, differs from 1/h by nearly 2h-! exp [ — 27°o?/h?]. The values of u and 
v and the original punching, the reproducing, gang-punching and sorting, were checked by 
methods which do not call for special comment. The Hollerith results for different columns 
of z were combined on desk machines, and a final check of that process and of the Hollerith 
work was obtained by calculating independently the quantities 


+12 \ +12 
p> en( >> Uns] ? >> en( >» tna) 
n k=-8 n . \k=-8 
for all o?. This brought to light the fact that one of several cards damaged in the sorter 
had been incorrectly renewed, but eventually exact agreement was obtained. 
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SUMMARY 


The expected number of species having r representatives in a sample of an animal popula- 
tion is considered. It is assumed that the population distribution of abundance is log-normal, 
and that the sampling process results in Poisson variation of the number obtained from each 
species. Tables dealing with the cases r = 0, 1 are provided, and their construction, checking 
and accuracy discussed. 


The author is indebted to Dr C. B. Williams for suggesting this investigation, to 
Dr F. Yates for his interest and encouragement, and to several colleagues for advice on 
the Hollerith work, especially Mr G. V. Dyke. Thanks are also due to A. C. Sphinx Sparking 
Plugs Ltd for the use of their Reproducing Punch. 
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TABLES OF SYMMETRIC FUNCTIONS. PARTS II AND III 
By F. N. DAVID anp M. G. KENDALL 


1. Ina previous paper (1949) we presented tables of the augmented monomial symmetric 
functions in terms of the one-part symmetric functions and vice versa. The present paper 
presents the monomials in terms of unitary functions (and vice versa) and the unitary 
functions in terms of homogeneous product-sums. It is convenient to recall the definitions 
of these functions: 

(a) The monomial symmetrics (which we denote for short by M) are typified by 


(pipe... pet) = Lapa... cheaPe... xhead*), (1) 


where the summation takes place over all suffixes i,j,...,v which are different. The 
augmented symmetrics are defined by 


[pt ... po] = (ph... pi) m! ... 7). (2) 
(b) The unitary symmetrics are defined by 
a, = (1") = La,a;...t, (t+I+.~.. +). (3) 


We denote them for short by U. 
(c) The one-part symmetrics or power-sums are defined by 


8, = (r) = Sat. (4) 
We denote them by S. 
(d) The homogeneous product-sums are defined by 
pal tt... tht +..., (5) 
where T = (1—a,t)(1—2,t)... = 1—a,t+a,t?-.... (6) 


We denote them by H. 

Briefly, Part I of these tables presented MS-SM tables. The present set gives MU-UM 
(Tables 2) and UH-HU {Tables 3). A third paper in preparation will give MH-HM, US-SH 
and SH-HS. In all cases we proceed as far as tables of weight 12. 

2. When Part I of these tables was prepared we were unaware of some work of Kerawala 
& Hanafi (1941, 1942, 1948), who have given tables of MS (but not SM) for weights 9, 10, 11 
and 12. Sukhatme (1938) had previously given tables up to w = 8. We have compared these 
tables with the corresponding parts of our own for w = 2 to 8 and w = 12; we have not been 
able to consult the tables for w = 9 to 11. There is complete agreement up to weight 8. For 
weight 12 we find 8 points of disagreement with the table by Kerawala & Hanafi, but a re- 
check of our own figures confirms them. There is therefore reason to hope that our own 
tables are free from error. 


3. Tables 2 give U in terms of M by reading horizontally up to the main diagonal, 


.g. from Table 2.11, 
e.g. from Table M3 = 165(111) + 36(219) + 7(2717) + (2315), ") 


M is given in terms of U by reading downwards to the main diagonal, e.g. from Table 2.11, 


(2315) = — 7744, + 27a 49, — Tag, + Ag Gz. (8) 
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It should be noticed that the order of the column headings in Tables 2 is not the same as that 
of the rows. In Tables 1 we introduced the augmented symmetric functions mainly in order 
to produce units in the main diagonal and hence to incorporate both MS and SM in one 
square array. This obviously saves a lot of space, and we have achieved the same result in 
Tables 2 by a special order of the columns. The rows, like the rows and columns of Table 1, 
are in ‘alphabetical’ order, that is to say, the partitions whose highest part is p all come 
before those whose highest part is less than p, and so on. In the columns of Tables 2 the order 
is by number of parts, those with more coming in front of those with fewer; of those with the 
same number, the later are those with unitary parts, then come those with parts 2 and so on. 


4. Tables 3 give H in terms of U by reading horizontally right across the table, e.g. from 
Table 3.5, 





hah, = —a,a, + 2a,a} + aga, — 3a,a} + a}. (9) 
If we combine (5) and (6) we have 
co : = 2X 4,(—1)'#, (10) 
She 7 
r=0 
oie : 1 ‘ 
which is equivalent to Sa(—1ye = xh, t". 
Substituting ¢ for —t¢ we find se Lh,(—1)"#, (11) 
La,t” 


which is exactly of the form (10) with a and h interchanged. It follows that any relation 
expressing h’s in terms of a’s is equally valid if we interchange h and a in it; and consequently 
the same tables express U in terms of H by reading horizontally right across the table, e.g. 
corresponding to (9), we have 


4a, = —hyh, + 2hgh? +h3h, — 3h, h3 + AS. (12) 


For Tables 3, therefore, one triangular array covers both direct and inverse relations. 


UsrEs oF TABLES 2 AND 3 


5. Tables 2 and 3 have various uses in the theories of statistics and probability. We 
exemplify a few of them. 

(a) Sampling from a finite population. Isserlis (1931, p. 593) gave what is equivalent to 
MU tables up to weight 8, omitting terms in a, which he did not need in his discussion of 
sampling from a finite population. (He refers to tables up to weight 10 by Salmon, who took 
them from Cayley (1857).) Isserlis’s method of evaluating sampling moments for a finite 
population of size N is to expand the statistic under investigation in terms of the M-functions 
and then calculate mean values by the simple relation 


ni) 
E( DT" «+ Pen = Fyq (PI «+ Pa")v» (13) 


where the suffixes to the monomials denote respectively that they relate to a sample of n 
and a population of NV, n = n(n—1)...(n— +1), and wis the weight of the monomial, 
namely, &7. He then expresses the population M-functions in terms of U-functions and 
thence in terms of S-functions. By the use of Tables 1 we can proceed direct from M- to 
S-functions, a course followed by Sukhatme (1944). 
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(b) Distribution problems. MacMahon (1915) gives a number of results concerning the 
distribution of objects of given type into boxes of given type. For instance, let there be p, 
objects of one type, p, of a second, ..., p, of an sth type and m boxes. Then the number of 
ways of distributing the objects such that there are g; objects in the jth box and no type 
of object appears more than once in the same box and no box is empty is the coefficient of 
(P1Po---P,) iN Ay, 4p, ... A, multiplied by the numerical coefficient of x?1+Pst---+Psq_ ay, ... Agn 
in the development of (a,x+a,x7+...)ut+%+---+¢m, Such coefficients are obtainable from 
Tables 2. For instance, if we have four red, three white, three black balls and one green ball, 
eleven altogether, they can be allocated so that there are two balls in each of five boxes and 
one in a sixth, no pair being of the same colour, in 6 x 430 ways; 430 being from Table 2.11 
the coefficient of (4371) in a3a,, and the 6 being the coefficient of za,a} in the expansion 
of (a,%+a,x7+...)%. 

Again, the number of ways of placing units in the compartments of a lattice with m rows 
and J columns so that not more than one unit occurs in any one compartment, and such that 
there are /4;, /g, ..., 4m Units in the successive rows and A,, Ag, ..., A, in successive columns, is 
the coefficient of (A, A, ... A;) in the expansion of a a,,,, Such results have applications 
in the enumeration of experimental designs. : 

(c) Multipartite numbers. MacMahon (1915, p. 155) shows that the number of com- 
positions of the multipartite number (pqr...) is the coefficient of the symmetric function 
(pqr ...) in the development of the algebraic fraction 

1 
2{1 — 2(a, -—a,+a3—...)}° 
The number of compositions can thus be obtained by expanding the fraction and adding 
the coefficients of M in each relevant U product. 


xp eee 





CONSTRUCTION OF THE TABLES 


6. The UM half of Tables 2 was obtained by building up the table for any given weight 
from those of lower weights. For example, from Table 2.9 
Gy, = 36(1%) + 7(217) + (2715), 
and hence 
Ay Giga, = (1) {a,aq} = 36{10( 119) + (218)} + 7{8(218) + 2(2218) + 7(317)} 
+ {6(2716) + 3(2314) + (3215)} 
= 360(129) + 92(218) + 20(2218) + 7(317) + 3(2914) + (3215). 
The MU half was obtained by inverting the MU relations. 

Four checks were available: 

(a) There is a law of symmetry which asserts that the coefficient of a,, ...a@,, in (q, ... q) 
is the coefficient of a,, ...a,, in (p, ... p,) and vice versa; hence each coefficient checks against 
an independently calculated coefficient except in a few symmetrical cases such as the 
coefficient of (2*) in af. 

' (6) If we put each member entering into a symmetric function equai to unity and take 
w members, there are simple arithmetical relations for each row and column; e.g. for w = 10 
we then have a) = 1, a, = a, = 10, ag = a, = 45, a, = a, = 120, a, = a, = 210, a, = 252. 
Also (12°) = 1, (218) = 90, (21%) = 1260, (317) = 360, etc. Thus, in the row corresponding to 
74a, in Table 2.10 we check by the identity 

54000 = 360 + 92(90) + 20(1260) + 7(360) + 3(4200) + 5040. 
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Each row and column was checked in this way: 

(c) the D-operator of Hammond referred to in the Introduction to Tables 1 was used in 
the manner there described to provide a further check; 

(d) the tables were collated with those published by Cayley (1857) for weights 2-10, and 
Durfee (1882) for weight 12. Tables for weight 11 have been given by Faa de Bruno (1876), 
but we have not been able to find a copy of this work in this country. 


7. Tables 3 were much easier to calculate. Again any table for given weight was built 
up from tables for lower weights. Since a, = h, any function with a unit part is obtainable 
immediately from a previously calculated value. For non-unitary parts we have, for example, 


highs = (—@q + af) (— 4+ 2a,a, + a3— 3a,aj + a4) 
= 444, —4, a} — 2a,a,4, + 2a,a3 — a3 + 4aza? — 4a,a¢+ a. 
Three checks were used: 
(a) the sum of coefficients in any row is zero; 
(6) the relations between a and h being identical we can check by mutual orthogonality; 
e.g. in Table 3.4, 
hgh, = a,a, — 2a,a?+a} 
= (hgh, — 2h hi + hi) — 2(— hgh + hh) +h} 
= hgh; 


(c) the D-operator also gives an independent check for each relationship. 


VALUES FOR HIGHER WEIGHTS 


8. For weights higher than 12 individual values can be derived by operational processes 
as exemplified in the Introduction to Tables 1. We assume an expansion of, say, a given 
M-function of weight w in terms of the U-functions and operate by Hammond operators of 
order (w— 12), identifying the result with values given by Table 2.12. This will determine 
nearly all the coefficients and any remaining can be determined by operating by a higher- 
order operator and comparing with a Table 2 for a lower weight. 

Full MU tables for weight 13 have been given by P. A. MacMahon (1882). It is thought 
that tables of weight 14 have been computed but not published. Tables of the 15th degree 
have been calculated by J. B. Shaw and printed as one of the publications of the Carnegie 
Institute of Washington. 


We are indebted to Prof. D. Lehmer for some of the references. 
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Table 2.2 Table 2.3 
| w=2 (1*) (2) w=3 (1*) (21) (3) 
| a; Zz —2 as z —— 3 
| ae 2 | Z aa, 3 Ir -3 
| a;* 6 3 I 
Table 2.4 Table 2.5 
| 
| w=4 (1) | (21) (2*) (31) (4) | w=5 | (1) | (2x*) | (2*1) | G1") | G2) (41) (s) 
a | I -4 2 -4 | as | Z “5 5 5 =e -s 5 
aa, | 4 I -2 -1I 4 | aa, | 5 Z -3 -1 5 I -5 
a,* | 2 I -2 2 32 10 3 Z —2 -1I 5 -5§ 
@a;? | 12 5 2 I —4 a,a;" 20 7 2 I -2 -1I 5 
a,‘ 24 12 6 4 x | a,*a 30 12 5 2 Z -3 5 
| t a,a;* 60 27 12 4 3 Z at 
| a,’ 120 60 30 | 20 10 5 I 
! i 
Table 2.6 
w=6 (1) (21‘) (2*1*) (31°) (2") (321) (41°) (3") (42) (s1) (6) 
| 
a I -6 9 6 -2 —12 -6 3 6 6 -6 
sa; 6 I —4 -1 2 7 I -3 -6 —é 6 
QQ, 15 4 I -2 —2 4 2 -3 2 -6 6 
a,a;* 30 9 2 Z -3 -1I 3 2 I -6 
20 6 2 ° Z —3 3 3 <3 rs 3 
34,4, 60 22 8 3 3 Z =3 73 4 7 12 
a,a;* 120 48 18 10 6 3 I 4 -2 -1 6 
a? oO 36 15 6 6 3 . I —2 2 —2 
a,*a;" 180 78 34 18 | 15 2 2 Z —-4 9 
a,a,* 360 168 78 48 36 22 9 6 4 I -6 
a,’ 720 360 180 120 | 90 60 30 20 15 6 I 
Table 2.7 
w=7 (1’) (2x*) | (2*x*) | (3x*) | (2%x) | (321%) | (41°) | (32") | (3%x) | Gq2z) | (51%) | (43) | (52) | (61) (7) 
a, I -7 14 -7 —21 a | 7 7 I 7 =7 =— ~—7 7 
aga, 7 z =—s =e 5 9 I =F —¢ = -t 7 7 I 7 
5a, 21 5 I 2 3 6 2 3 —_ —4 2 7 = 7 77 
a,a;* 42 Ir 2 I -4 a | 2 4 3 I -7 -2 -I 7 
aa, 35 10 3 Z -3 3 a 5 —2 =—7 <9 7 7 aw 
44424, 105 35 11 4 3 I -3 -2 -1 8 3 —2 4 --8 14 
aa; 210 75 2 13 6 3 I ‘ ° —3 -!I 3 2 I -7 
@,* a, 140 o I 6 7 2 I -2 —1I 4 5 -7 -4 7 
3a, 210 t) 31 12 12 5 2 I —2 2 -1 3 -7 7 
a@;a,a;* 420 170 68 4 27 13 3 5 2 I -4 -3 6 9 —21 
a,a,* 840 360 150 8 60 34 13 12 6 4 I -2 -! 7 
a,' a, 630 270 II 60 Sr 2 6 12 Z 3 I -3 5 -7 
a,*a;* 1260 570 25 150 117 6 24 I I 11 2 3 Zz -5§ 14 
a,a,° 2520 | 1200 570 60 270 170 75 jo 50 35 Ir 10 5 I -7 
a 5040 | 2520 | 1260 40 630 420 210 210 140 105 42 35 21 7 I 
| 
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Table 2.8 

w =8 (i) oe)... 5. ee (2*1*) (31°) (2*1*) (321°) (41‘) (24) (3271) (3*1*) 
a, Z -8 20 8 —16 —32 -8 2 24 12 
aa; 8 I -6 -1 9 Ir I -2 —17 —5§ 
Aga, 28 6 Z -2 —-4 8 2 2 “ -9 
a,a;* 56 13 2 Z ° —5§ -1 ° 5 5 
asa, 56 15 Z —3 3 —3 6 3 
54,0, 168 51 14 5 3 Z —% > =% 
a,a,;* 336 108 30 16 6 3 ZI a 
a? zo 20 6 2 Z -4 2 
4430; 280 95 32 10 11 3 4 Z -2 

a,* 420 150 53 20 18 7 - 2 s 
@,a,a;" 840 315 154 55 9 18 4 12 5 2 
a,a;* 1680 6 140 4 46 17 24 12 6 
Gs" a, 5 210 oO ° 31 12 12 5 2 
a,'a,* 1120 172 C) 68 30 6 28 12 4 
,Q,* a; 1680 284 140 11 58 12 24 12 H 
@,a;,a,* 3360 1440 612 40 25 14! 46 108 5 ° 
a,a,° 6720 3000 1320 0° 570 340 140 240 140 C) 
e 2520 1080 468 240 204 108 24 28 

a,'a,* 5040 2250 1008 570 453 258 8 2 117 68 
a,*a;* 10080 4680 2172 1320 I 612 24) 4 2 172 
a,a;° 20160 9720 4680 3000 2250 o- 660 1080 ro 
a, 40320 20160 10080 6720 5040 33 1680 25206 1680 1120 } 

w=8 (ii) | (421") | (51°) (3°2) (42%) (431) (s21) (61*) (4°) (53) (62) (71) (8) 
a 8 -8 -8 —16 —16 -8 4 8 8 -8 
1a; —10 -1I 8 +8 9 9 I -—4 -8 —8 -1 8 
QQ, -6 -2 2 —4 16 4 2 —4 -8 4 8 8 
a,a;" 4 I —5 -2 -9 -3 —I 4 8 2 I -8 
sa; -9 -3 -7 8 I I 8 “¢ 7 -8 -8 8 
54,4, Ir 3 5 —4 —10 -8 -3 I 4 9 —16 
a,a,* -4 -1 7 2 ¢ 3 I “7 -3 -—2 -1 8 
a? 4 —4 4 % “~ 4 -4 ~ —4 4 ’ 
4430, -1 4 -1I 10 —10 -9 -8 I I 2 -! { 
4a, -2 2 —2 4 —4 -2 -4 8 —4 -8 
a,a,a; Z -4 ~ -2 -1 11 4 4 -9 -6 —10 24 
a,a;* 4 Z é -3 -1 . 3 2 I - 
a3", I -2 -1 5 -5 4 -7 2 8 -8 
a,'a,* 2 2 I -2 -1 5 2 3 -9 -5§ 12 
G54," a, 5 5 2 I =$ 5 -4 6 mei A 24 
a@,a,a,* I 3 12 7 3 ZI -5§ ° -3 8 Ir —32 
a,a;* 55 16 30 20 10 5 Z ° -2 —1I 8 
a, 12 ‘ 12 6 4 . Z —2 2 2 2 
a, a,* 39 6 I 18 11 3 2 I —4 9 | 136 
a,*a,* 114 jo oO 53 32 14 2 6 4 Z -6 | 20 
a,a,;* 15 108 210 150 5 51 13 20 15 6 zx} -8 
a,’ 40 336 560 420 280 168 56 7° 5 28 8 Z 
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Table 2.9 
| | 
(1) | (21”) (2*18) (31°) (2*1°) | (321°) | (41°) (2*r) | (3227) | (3*1*) | (421%) | (514) | G2%) 
| 
} 
I | -9 27 9 — | > 9 54 18 36 -9 
9 | I -7 | -1 14 3 I -7 —30 —-6 | —12 -1 9 
36 7 Z -2 -5 10 5 -5 —1Ir -8 - -5 
2 15 2 # ° -6 -1I " 9 6 5 I -2 
4 21 5 : Z = 3 rs o) 3 —I2 -% 3 
252 70 I~ 6 3 Z -3 ‘ —4 —! 14 s 2 
504 147 36 19 6 3 I : -5 =f : 
126 35 10 ; 3 I —-4 2 —4 -1I 
504 161 50 15 15 4 4 Z -—2 -1 4 -2 
75 252 81 30 24 9 6 2 Zz —2 « 
1512 525 172 81 51 2 5 12 5 2 Zz —4 ° 
3024 1092 366 204 108 5 21 24 12 6 4 r ~ 
630 210 7° 20 9 2 . I 
1260 455 165 60 60 22 22 8 3 3 
2520 945 350 155 129 54 10 19 6 3 7 
3780 1470 565 270 213 1or | 20 78 3 17 7 12 
75 3045 1200 645 459 241 75 168 5 42 25 4 27 
15120 6300 2550 1500 990 570 225 360 200 110 75 21 60 
1680 630 240 | 90 93 36 36 15 ‘ 6 
5040 2030 820 390 333 158 30 136 65 30 12 ° 27 
| 4 
10080 4200 1740 | 920 720 | 372 | 110 300 152 72 42 6 61 
7560 3150 1320 oO 555 282 | 60 | 234 120 64 27 5° 
15120 6510 2800 1530 1203 | 656 200 | 516 281 152 85 12 12 
302. 13440 5940 3480 2610 | 1516 570 | 1140 656 372 241 58 282 
60480 27720 12600 7800 5670 3480 1500 | 2520 1530 920 64 204 660 
22680 10080 4500 2520 2016 | 1140 360 | 906 51 300 16 2. 234 
45360 20790 9540 | 5670 4383 2610 990 | 2016 | 1203 720 459 1 555 
90720 840 20220 12 9540 5940 | 2550 4500 2800 1740 1200 366 1320 
181440 8200 42840 27720 20790 13440 | 6300 10080 6510 4200 3045 1092 3150 
362880 181440 90720 60480 45360 30240 | 15120 22680 15120 10080 7560 3024 7560 
| 
(3°21) | (42*x) | (4317) | (s21*) | (61°) | (3°) | (432) | (52*) | (4*x) | (534) | (621) | (71*) | (54) | (63) | (72) | (81) | (9) 
ae | aes rar [ar | 92 3 18 9 9 18 18 9} -9|-9} -9] -9 9 
19 19 II II I | -3 -18 | -9 | -5§ |-10;-10 | —-1 9 9 9 I -9 
13 = 20 6 ) 3 —4 5| ~-9 |-18 | —4]| -3 9 9| -S5 9} -9 
—12 -§ | -11 -4|-1 3 II 2 5 10 3 I -9 | -9 —-2|/-1 9 
—18 9 9 9 $1) 6 - | -9 | -9 . -9 oh) =o 9 ol - 
S| —12 —13 —-II -3 | -3 4 4 14 10 8 3 |-18 ° —-4|-I0 I 
5 5 4+ I . =$§ —2 =< —=~— ye” — 9 3 - i —-9 
7 -3 | —13 7 9 -3 2 I Ir 2\|-1 -9 |-11 9 2 9|- 
4 2 5 —15§ —4 —3 —2 8 -6 5 10 10 . | ~2 —10 I 
-—3 6 —-6 —2 3 —6 ; —t 4 2 I -9 5 -9 9 
: -3 -1I 1s 4 5 —6 t}—-15 |-11 -4 7 9 6 ir | -27 
° . -4|-!1 2 4 3 I -4| -3 -2 -1 9 
3 =—2 5 I ai. 3 “- es =< = 14 5 11 -9 -9 a 2 
I —2 -1 5 -5§ -3 8 -6 -2 -2 4 Ir 2 ° —-4\|-1 1 
2 Zz -2 -1 5 P -1 ‘ 5 § |-13 |-11 | —-13 9 20 11 | -27 
5 2 I 3 5 : -2 6| —: 2|-12 -5|-3 9 —1 19 | —27 
12 7 3 z| -5 ‘ -2 . -1 14 5 4 |-12 -8 |-12 36 | 
30 20 10 5 I ‘ . ‘ -3 -1 “ 3 2 I -9 | 
3 . . . Z -3 3 31 -Fi 3 “3 oi) <s1) 9 
13 5 2 3 Ki -3 1] =§% 4 7 |-12 7 |-18 13 19 | —27 
30 17 6 2 6 3 Z -2 -1I 6 2 3 -1 —6 18 
27 12 7 6 3 > Z —2 2 -2 -1 3 -5 9 -9 
65 | 36 19 5 15 2 2 ei <<} 9| -4 9| -5 |-30] 54 
15 101 54 23 3 36 22 9 6 4 I -6 -3 10 13 | —-45 
390 270 15 81 19 90° 60 30 20 15 6 Z ° —2 -1 
136 78 3 12 36 22 9 4 > I -3 5 -7 9 
33 213 129 51 6 93 60 | 24 | 24 15 3 e4, = 14 | —30 
20 565 350 172 36 240 165 81 7° 50 17 2 to 5 Z —-7 | +27 
2030 | 1470 945 525 | 147 630 455 | 252 | 210 | 161 7Jo 15 35 21 7 z/|-9 
5040 | 3780 | 2520 | 1512 | 504 | 1680 | 1260 | 756 | 630 | 504 | 252 72 | 126 84 36 9 zr 
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w = 10 (i) (x**) (21°) (2*1*) (31") (2*1*) (321°) (41°) (21?) (32*1*) 
ayo Z —10 35 10 —50 —60 —10 25 100 
ya, 10 Z a —1I 20 15 I —16 —46 
4; 45 8 Z -3 —6 12 2 9 —12 
aa," go I 2 Z ° -7 -1 d 14 
Q,Q3 120 2 6 . Z -3 3 -4 12 
7 QQ; 360 92 20 7 3 Z 2 : ~§ 
a,a;* 720 192 42 22 6 3 I : ‘ 
Aga, 210 56 15 ui 4 ms ; I —4 
4,430, 8 252 72 21 19 5 4 Z 
a,a;* 12 392 115 42 30 Ir 6 2 
44,4," 2520 812 242 112 63 28 6 12 5 
a,a,* 5040 1680 510 280 132 70 25 24 12 
a;* 252 7° 20 . 6 - i 2 ‘ 
54,4, 1260 rey 130 35 42 10 . 14 3 
54345 2520 68 296 105 99 35 ‘ 32 II 
a;a,a," 5 1792 622 266 210 85 15 68 26 
Gs," 75 2772 990 462 339 156 30 108 48 
54,4; 15120 5712 2082 1092 720 368 Ir 228 112 
a,a,° 30240 11760 4380 2520 1530 860 330 480 260 
aed, 3150 1120 400 140 144 50 é 53 18 
aa? 6300 2310 840 350 306 120 20 114 42 
4a 4200 1540 570 210 213 80 . 80 31 
4 030,0, 12600 4900 1900 875 735 335 60 284 128 
a@,a,a,° 25200 10080 3990 2030 1566 775 215 612 204 
a,* I = 7560 3015 1470 1194 585 120 468 228 
a,a,*a,* 37800 15540 6330 360 2547 1340 390 1008 523 
a,a,a,* 75600 31920 13290 7560 5436 3050 1095 2172 1196 
a,a,° 151200 65520 27900 16800 11610 6900 2850 4680 2730 
a;° a; 16800 6720 2700 1260 1092 10 go 444 210 
a;*a,;* 25200 10360 4280 2100 1776 80 180 740 370 
a,'a,a,;* 50400 21280 8980 4760 3792 2000 570 1604 844 
a,*a,* I 0° 43680 18840 10640 8100 4520 1580 3480 I920 
3 4,;° a; 75600 32760 14220 7770 6180 3390 1020 2088 1479 
a3 a,'a;* 151200 67200 29820 17220 13212 7610 2730 5844 3358 
@,a,a,° goagee 137760 62520 =— 28260 17000 6900 12720 7610 
a;a," 282240 131040 2320 60480 37800 16800 27720 17220 
a,* 113 50400 22500 12600 10080 5700 1800 | 4530 2580 
a,‘a,;* 226800 103320 47160 27720 21564 12720 4680 | 9876 5844 
a,°a;* 453600 211680 $8820 60480 46152 28260 11610 | 21564 13212 
a,*a,° 907200 433440 207000 131040 98820 62520 27900 | 47160 29820 
| 
a,a,° 1814400 887040 433440 282240 211680 137760 65520 | 103320 67200 
a,’* 3628800 1814400 907200 604800 453600 302400 151200 | 226800 151200 
w = 10 (ii) (3*1) (421°) (51°) (2°) (32°1) (3*21") (42*1*) (431°) (s21*) (61°) 
a1 25 50 10 —-2 —40 —60 —60 —40 — 40. —10 
Qa, —7 —14 —% 2 31 3 33 13 13 I 
QQ, -13 —10 -2 -2 -8 2 4 24 8 2 
aa," 7 6 I . -7 —21 - —13 -§ -1I 
7M 3 —15 -3 2 -2 —24 I 12 12 3 
72a; -% 17 3 : 5 9 —23 —16 —14 a9 
a,a,* ‘ -6 -1 ° ° ° 9 6 5 I 
A 2 4 —4 -2 8 -12 -8 16 4 
CAC AN —3 “9 4 : =% 6 3 5 -1 -—4 
a,a,* I -2 2 ° -4 > -2 
@,a,a;* 2 I —-4 ‘ F ‘ —4 -1 19 4 
a,a,* 6 4 I ‘ ; é ; . -5 -1 
a;* : . . Z —% 5 5 ee -5 5 
454,44; . . . 5 Z a = 5 I —§ 
G5 43, ¢ ° ° 10 3 —2 =f. 5 —5 
4, @3a; 4 . 20 7 2 Z —2 a +5 
54," a; 22 9 ; 30 12 5 2 I -3 5 
45,4," 54 32 5 60 27 12 7 3 I -5 
a,a;° 140 95 26 120 60 30 20 10 5 IZ 
a3 ds 6 > : 20 7 2 ‘ 5 4 x 
aa," 12 6 . 45 16 4 2 i : - 
a, 12 . 3 30 12 5 : 5 : 4 
4443020; 56 22 ‘ 110 49 21 8 3 ~ é 
a,a,a,;* 132 76 10 240 112 48 2 9 3 r 
a,* 115 48 . 180 8 42 I 10 ‘. ‘ 
a,a,;' a," 270 149 20 go 19 99 54 27 7 
@,a,a,* 650 416 95 40 450 236 149 76 32 4 
a,a,° 1580 1095 330 1800 1020 570 390 215 111 25 
a,' a, 96 36 ° 180 87 = 15 6 ° : 
a,*a,* 188 78 310 156 t) 34 18 
@,*a,a;" 436 236 30 680 58 186 99 48 12 ; 
a,*a,* 1032 650 140 1500 20 436 270 132 54 6 
@,a;° a, 820 450 60 1170 645 58 198 112 27 . 
@,4,*a,* 1920 1196 260 2580 1479 44 523 294 112 12 
Gy 04a; 4520 3050 860 5700 90 2000 13 775 368 ze 
@, a," I i 9 2520 12600 777° 4760 33 2030 1092 280 
a,* 1500 120 2040 1170 ) 390 240 60 2 
a,‘a;* piso 2172 480 4530 2688 1 1008 612 228 24 
a,'a;* 100 5436 1530 10080 6180 37 2 2547 1566 720 132 
a,*a,* 18840 13290 4380 22500 14220 oO 6330 3990 2082 510 
a,a,* 43680 31920 11760 50400 32760 21280 15 10080 5712 1680 
a,* 100800 75 30240 113400 75600 50400 37800 25200 15120 5040 
























































Table 2.10 (cont.) 










































































(4321) | (s2*z) (4*1*) (s31*) (621°) (71°) (43°) (4#2) 
60 30 15 30 30 to —10 —10 
—42 —21 -6 —12 —12 -1 10 10 
—28 2 11 —22 -6 -2 10 2 
26 5 12 4 I -10 —-6 
—-9 —a8s —— -9 —3 ~~ 10 
2" 12 17 13 Ir 3 I —12 
—12 - -6 - - -1I 3 
12 -1 9 - - —10 -2 —14 
—% 18 -3 15 15 4 13 
- 4 1 10 2 —4 10 
7 a I —19 —15 -4 -3 ~~ 
; 10 2 F 
- ° 10 -1 ~ 5 
P -1I 3 —12 I Ir 3 4 
5 -13 I -4 15 5 I —12 
2 2 3 —I9 a =—3 2 
9 9 =F 2 —1I7 —3 3 2 
-3 ; -1 19 5 ; é 
: ‘ - -1I é ‘ 
4 2 -3 2 - 6 2 2 
-3 ae 3 2 I -6 3 -3 
-3 3 3 -3 -3 3 -1 2 
I -3 -3 4 7 —12 -3 4 
3 Zz ° -—2 -!I 6 é e 
3 ° I -2 2 -2 ‘ -2 
2 2 Z -4 9 $ r 
22 9 6 4 Z -6 « ° 
60 30 20 15 6 Z « © 
3 ‘ Z : ‘ 5 I -2 
. 2 L 4 > 2 I 
21 5 4 2 ° ° 5 2 
56 22 = 8 2 12 6 
12 I 12 
38 48 42 26 is od ib 
35 156 120 85 3 50 
75 2 350 2 112 22 210 1 
110 30 45 20 a 3 3° 20 
108 1m 68 12 53 
735 339 = 210 63 6 213 144 
1900 990 622 242 42 570 400 
4900 2772 2310 1792 812 192 1540 1120 
12600 7560 6300 5040 2520 720 4200 3150 
(631) (721) (81°) (s*) (64) (73) (82) (91) (10) 
—20 —20 —10 5 10 10 10 10 —10 
11 II I -5 —10 —10 —10 -1 10 
20 4 2 —-5§ —10 —10 6 —10 to 
-11 -3 -1 5 10 10 2 I —10 
-1 -1I 10 -5 —10 11 —10 —-19 10 
—-10 -8 -3 10 20 -1I 4 11 —20 
4 3 I -5 —10 -3 -2 -! 10 
-4 20 10 —5 14 —10 —10 —10 10 
- —10 —11 10 -4 -1 20 Ir —20 
- —4 -2 5 -2 10 -6 10 —-10 
15 11 4 —15 -6 -9 -6 —12 30 
—4 -3 -1 5 4 3 2 I —I0 
10 10 5 10 -5 -5 —5 =—§ 5 
-7 -31 -11 —15 -4 20 20 11 —20 
—19 -3 —12 —15 20 -1 20 —20 
I 13 12 10 -6 -9 —22 —12 30 
I 12 5 10 —18 -9 2 —21 30 
—19 —14 -5 —5 16 12 8 13 —40 
4 I ° -4 -3 -2 -1 10 
4 —12 -6 5 —14 10 2 10 —10 
=2 17 6 5 9 —15 -11 -6 15 
13 I —10 5 -2 -11 10 10 -10 
-15 21 26 —5§ 12 3 —28 —42 
5 —16 -13 -5§ -8 12 ar 13 —40 
. 4 2 -5§ 10 —10 —10 10 
3 —23 = 5 —12 18 4 33 —60 
-1I 17 . + a5 —tI0 —14 5° 
-3 -1 ‘ : 2 I —10 
-7 -4 7 -5 2 11 —10 -7 10 
3 “FJ 7 5 -9 I 8g 1s 
6 9 —21 5 —24 28 33 —60 
-2 -rI 7 . 2 3 == —7 25 
-3 5 -7 -5 8 —3 - 31 —40 
Z -s 14 ° -4 12 —12 —46 100 
5 Zz -7 ° . -3 12 15 —60 
21 7 Z ‘ . -2 -1 10 
7 - > Z -2 2 -2 2 —2 
4 i = 2 Z —4 9 —16 25 
19 3 : 6 4 Z -6 20 5° 
72 20 2 20 15 6 z -8 35 
252 9 17 70 56 28 8 Z 10 
3 go 252 210 120 45 10 Z 
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Table 2.11 
w =r (i) (1) (21°) (2*17) (31°) (2*1°) (321°) (41’) (2*1*) (32*1*) 
a; I ~32 44 II -77 ae oi. 55 165 
y9; II Zz -9 -1 27 17 I --30 —65 
p42 55 I -2 -7 14 2 14 —21 
a,a;* 110 19 2 Z -8 -1I 20 
Gy a3 165 36 7 Z *% 3 =| 15 
yQ20, 495 117 3 8 3 Z —3 -6 
@,a,° 990 3 25 6 3 I 
a,a, 330 4 21 5 Zz —4 
@,a,a, 1320 372 98 28 23 6 4 Z 
@,Q, 1980 576 155 56 36 13 6 2 
a,a,a,;* 3960 1188 324 I 75 3 7 12 5 2 
a;a,* 7920 678 368 156 2 29 24 12 6 
CA 462 126 35 10 3 ‘ 
44,4, 2310 714 217 56 65 15 19 
G4Q3 42 4620 1512 483 168 148 51 42 14 
4,434; 9240 3108 1008 420 gil 123 21 88 33 
444," a; 13860 4788 1589 728 495 223 42 138 
@,a,4a,° 27720 9828 3318 1708 1041 522 154 288 139 
a; 55440 20160 3° 3920 2190 1210 455 320 
a;* a, 2772 882 280 7O 90 20 30 
54,42 6930 2394 826 280 285 95 ; 99 32 
a;a,a;" 13860 4914 1722 686 600 225 35 210 74 
asa," 9240 3276 1162 re] 411 150 > 144 53 
5434,4; 27720 10332 3808 1708 1383 611 105 492 213 
53a; 55440 21168 7938 3920 2916 1398 371 1044 484 
asa,* 41580 15876 5985 2856 2211 1053 210 792 72 
asa," a," 83160 32508 12474 6468 4665 2388 672 1680 43 
5 42; 166320 66528 25998 14448 9846 5382 1869 3564 I 
5a; 332640 136080 54180 31920 20790 12060 4830 75 4290 
a? as 11550 4200 1540 560 570 210 213 80 
a4? a2; 34650 13230 5040 2240 1920 840 140 735 316 
a,?a,? 69300 27090 10500 5110 4050 1910 490 1566 714 
4a; ay 46200 18060 7070 3220 2775 1260 210 1092 497 
43 ay 69300 27720 11095 5320 4440 2135 420 1776 854 
@,4,@,a;* 138600 56700 23100 11900 9375 4785 1295 792 1917 
@,a;,a,* 277200 115920 48090 26320 19800 10670 3535 100 4292 
@,a;° a; 207900 86940 36225 19320 15015 7995 2310 6180 3276 
a,a,* a; 415800 177660 75390 42420 31725 17730 Oo 13212 7319 
444,a,; 831600 362880 156870 92400 67050 39150 15225 28260 16320 
aa," 1653200 740880 326340 199920 141750 86100 36750 60480 36330 
a,7 a, 92400 37800 15540 7560 6420 3150 630 2664 1320 
a;*a,;* 184800 77280 32340 16800 13560 7020 1890 5700 2952 
@34 aq" a, 277200 118440 50680 27160 21720 11660 360 9324 5016 
a;*a,a,° 554400 241920 105420 59360 45900 25740 750 19980 11168 
a,*a,° T108800 493920 219240 128800 97020 56600 21700 42840 24820 
a; a,‘ quence 181440 79380 43680 34800 19260 5880 15282 8406 
G34, a; 31600 370440 165060 94920 73560 42330 14910 32784 18876 
@; a," a,‘ 1663200 756000 343140 204960 155520 92700 30330 70380 41864 
@;a,a,° 3326400 1542240 713160 440160 328860 202320 6100 151200 92700 
a,a,° 6652800 3144960 1481760 940800 695520 440160 199920 325080 204960 
a,'a, seepeee 567000 258300 151200 117900 69300 25200 53910 31800 
a,‘a,* 2494800 1156680 536760 325080 249300 151200 60480 115884 70380 
@,* a," 4989600 2358720 IImstoo | 695520 527220 328860 141750 249300 155520 
a; a,’ 9979200 4808 160 | 2315880 | 1481760 III5100 713160 326340 536760 343140 
@,Q; 19958400 9797760 4808160 | 3 144960 2358720 1542240 74 80 1156680 75 
a" 39916800 | 19958400 | 9979200 | 6652800 4989600 3326400 1663200 2494800 1663200 
1 i 
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Table 2.11 (cont.) 

w =11 (ii) (421°) (s1*) (2°1) (32°1*) (3*21°) (42*1*) (431°) (s2r‘) (61°) (32*) 
ay 66 11 -11 —110 —110 —110 -55 —-55 —11 11 
394; —16 -1I 9 7° 50 50 1 1s I -11 
QQ, —12 -2 -7 -7 47 Ir 10 2 7 
aa," Z I ; —16 —32 -14 —15 -6 -1 2 
4,43 -% -3 5 —10 —30 30 15 1s 3 -5 
yg 0, 20 3 . 9 Ir —-37 —19 ss | —3 -2 
a,a;* -7 —! ° " ° . Z I 
aa 4 -4 -3 12 -2 - ~ 20 4 3 
@; Qs; ~_ 4 ° -4 8 4 5 —23 -4 2 
a,a,* -2 2 ° Pe —5 10 af —10 -2 Py 
@;@,a;* Zz —4 ‘ ° P -5 -1 2: 4 e 
a,a,* 4 Z . ° . “ : - -1 . 
244% . : rz -5 5 5 -$ -5 5 —1 
40,2; : . 5 Z —3 -1 5 I —5 -2 
40343 5 : 10 3 Z -2 -1I 5 -§ ° 
4030; 5 . 20 7 2 I -2 -1I 5 $ 
,a,*a, II : 30 12 5 2 Z -3 5 ° 
Gy; 39 6 27 12 7 3 I -5 s 
a,a,° 115 31 120 60 30 20 10 5 I e 
a;* a; ° . Ir 2 . . . ‘ . Zz 
G54, a, : : 35 11 3 . . . 4 
@;a,a,* 10 : 75 2 3 . : 9 
a,a;* > . 5° I 7 . + . 6 
5 434;4, 35 : 170 71 29 11 4 22 
a,a,a;" 120 1s 360 160 66 37 12 4 48 
@,a;* 75 : 270 123 57 24 13 36 
asa,*a;* 231 30 570 276 133 72 33 9 7B 
52a; 639 141 1200 618 314 197 41 5 168 
a;a,* 1665 6 2520 1380 750 510 275 141 31 360 
a3 as ; ° 80 31 12 . ° . ° 12 
af a.a, 50 ° 285 121 48 18 6 lp 48 
aa, 170 20 615 272 108 60 18 6 ° - 
aa; a, ) . 430 197 89 31 12 * S 7 
a3; 170 ° 710 341 164 68 34 ° ° 136 
@,434,a;" 505 60 1530 766 325 196 go 22 . 306 
@,a;a,* 1370 275 3300 1718 52 244 98 10 
0; a; 045 120 2520 1326 607 3 203 48 ° 528 
a,a,*a;° 2475 510 5430 2973 1613 93 $27 197 20 1182 
a,a;*a,° 6225 1665 11700 6660 750 2475 1370 $38 115s. 2 
aa,’ 15225 4830 25200 14910 750 3535 18 455 58 
Qs" dy 270 Irro 55 282 117 60 . 23 
a;*a;* 780 I “- 32 156 36 : 528 
Gs" a3" ay 1450 180 O10 2162 11 62) 338 78 . 934 
@,*a,a; 3750 75° 700 4852 2682 1613 64 31 30 2112 
a;*a,* 935° 2420 18900 10880 6180 4030 2210 I 170 4780 

a,* 2 360 6720 750 2112 1182 688 168 1656 
a,a,'a;" 1380 14610 15 4852 2973 1718 618 60 750 
@;a,'a,* 16320 4290 31 1887 11168 7319 4292 1904 320 Bio6 
32; 39150 12060 69300 42330 25740 17730 10670 5382 1210 192 
a,a,° 92400 31920 151200 94920 59360 42420 26320 1 3920 43680 
a,' a, 11700 2520 14610 8700 5430 300 1200 120 6720 
a;‘a;* 28260 7560 53910 32784 19980 13212 100 3 600 15282 
a,°a,° C780 207 117900 73560 45900 31725 19800 2190 4800 
a,°a;’ I § = I 258300 165060 105420 75390 25098 S930 g2380 
a,a,' 3 Oo 13 567000 37 241920 177660 115920 528 201 181 
a; 3 1600 332640 1247400 31 554400 415800 277200 166320 $5440 415800 
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446 Tables of symmetric functions 
Table 2.11 (cont.) 
w=r11(iii)} (3*2*1) | (42%x) | (3*1*) | (4321*) | (s2%r*) | (4#x*) | (532%) | (621°) (71*) (3°2) (432*) | (52°) 

ay 66 44 22 132 66 22 4 4 11 —1I -33 —Ir 
G94; —51 —34 —12 —-72 —36 -7 -1 -1 —1I II 33 11 
yz —12 Io -19 —60 -3 —-13 —2 - -2 H -3 -—49 
aa," 24 7 12 45 2 Z 14 5 I = —15 —2 
,y 30 —20 14 —12 -1 -1 —12 —12 -3 — 9 II 
44,4; -17 16 -5 44 23 20 16 14 3 -6 —4 
a,a;* ° -7 . —21 -9 -7 -6 -5 -!I . 7 2 
Q71% —24 12 6 36 —24 6 —16 -—16 —4 11 -9 - 
7434; : -2 -9 —21 33 -3 19 19 4 2 : - 
a,a," 5 —1r0 5 ~10 10 -1I 12 2 -5 10 e 
@,a,a,;" 5 PF 9 —32 I | -19 -4 ° -7 6 
a,a,;* . ° 9 o 5 I é s -2 
as 9 —4 -7 —12 9 8 16 —14 11 -4 3 I 
44,0; 6 2 3 —12 - -5§ -6 24 5 -7 2 
Ag Q3 A, -3 6 -1I 6 -r I —4 20 5 5 —12 6 
e431 . td : 6 3 2 3 —24 —§ . 3 : 
Qe, a, “ ‘ —4 12 -1 2 —22 -5§ . 2 -6 
e450; e ° . —4 ° -!I 24 5 e » 2 
44a; . ° ‘ ; i ; -5 -1I ‘ > 5 
@,;* a, —4 -1 2 7 I -3 -6 -1 6 4 -3 -1 
25 QyQ, Z -—2 -2 4 2 -3 2 -6 6 -1I 2 " 
a,a,a;" 2 I a -3 -1 3 2 I -6 ° -1I ‘ 

a,;* 2 . Z — 3 3 ~3 =-9 3 —2 6 -6 
5.25 0,0; 8 3 3 I -3 -3 4 7 —12 : -2 6 
@,a,a,° 18 10 6 3 Z ; -2 -1I 6 . . -2 
a,a,* 15 6 6 3 . Zz —2 2 -2 . ° e 
aa, a," 34 18 15 2 2 Zz —4 9 e “ ‘ 
5,0; Z 48 36 22 9 6 4 I -6 . > ° 
a,a,* 180 120 90 60 30 20 15 6 Zz ° ‘ . 
a," ay 5 ° 2 ° A ® > ° ° Z -3 3 
af aa; 19 7 6 2 ‘ ‘ = . ° 3 Zz - 
a*a, 42 23 12 6 2 ‘ m ‘ ° 6 3 4 
Qa, a 36 12 17 5 e . e ° 7 2 ° 
G40, a, a 27 30 13 : 3 . : 12 S . 
4, 4;0,a," 1 76 69 34 8 6 3 . : 27 13 3 
@,a,a,* 338 203 156 go 3 18 12 3 ° 60 34 13 
Q,4,° a; 27 14 138 76 I 23 10 : ° sr 2g 
@a,',! 62 37 327 196 72 60 37 7 . 117 24 
4440, 1450 945 80 505 231 170 120 39 4 270 170 75 
a,a," 3360 2310 1890 1295 672 490 371 154 29 630 420 210 

a, 120 si 27 ° 6 e ° ° 27 12 . 
a;'a;* 276 138 at 15 12 6 i ° 64 ° 6 
G5," ay 506 273 276 14 34 42 18 . : 120 5 15 
Gy" aya; 1168 7 642 375 133 108 66 12 ° 282 164 37 
a,*a,* 2700 174 1500 950 422 300 210 66 6 660 41 181 
aa,‘ 934 $2 528 3 78 109 4 5 : 234 13 36 
@,a,°a,* 2162 1326 1248 766 276 272 I 27 . 555 41 123 
aa,*a,* 5016 3276 2952 1917 843 714 bo | 139 12 1320 54 372 
yy; 11660 7020 4785 23 1910 13 522 82 3150 2135 105, 
a,a,* 27160 19320 16800 11900 6468 5110 3920 1708 368 7560 5320 285 
a@,*a; 4010 2520 2400 1530 f) 61 360 60 . 1110 710 270 
a,‘a;* 9324 I 5700 3792 1880 15 1 288 - 2 1776 792 
a,*a,' 21720 15015 13560 9375 4665 4050 291 1041 15 4440 2211 
a,*a;" 50680 me 32340 23100 1247 10500 7938 3318 678 15540 T1095 5 
aa; 11 7280 sb700 325 27090 211 28 37 27720 15 ze 
a," 277200 | 207900 | 184800 | 138 3160 | 69300 | 55440 | 27720 7920 | 92400 | 69300 | 415° 


















































—— a 








me NO ANN DH 


“Nan « 


- 


OMAP OWWe + Sw Ww: 





One ay Pog Ie Ow NS Ne 


F. N. Davin anp M. G. Kenpati 


Table 2.11 (cont.) 


447 














ew 





w= 11 (iv) (43*x) | (4721) | (s32x) | (62*1) | (s4z*) | (631%) | (721") | (81°) (4°3) (s3*) (s42) | (632) 
ay —33 -33 — 66 —33 —33 —33 —33 ax 11 11 22 22 
494; 23 23 46 23 13 13 13 I -11 -11 —22 —22 
yy 33 15 30 ~2 24 24 6 2 =a5 -11 —4 -4 
a,a;* —23 —14 —28 -5§ -13 —13 —4 -1 II II 13 13 
aya —1 2r -6 33 9 9 3 I 13 —22 2 
Gy 4,0, + —34 —20 —i2 “i 33 —i1 —% 10 —-2 26 2 
@,a,° 7 14 12 5 13 5 4 I —7 ~- —13 -5 
Q1a% —23 —23 38 19 5 5 5 11 . -11 6 —22 
@7Q3Q; 27 12 —33 _ —f —15 —15 -4 _ -2 16 20 
a,a;* -5 13 —16 —4 —10 —1I0 —6 -2 -3 II -10 4 
@, aa," -4 -8 41 17 22 19 15 4 7 -9 —13 —15 
a,a,* “ a —12 -5 -6 -5 ~4 -1 : 3 6 5 
4a, 18 3 “= 3 —2 3 33 Ir —I1 4 8 8 
44,0; 3 —24 I —12 —18 —12 —6 7 —14 I 
G4Q342 -3 —-6 30 -—12 3 —2i —15 -—5 10 —17 —4 ¢ 
2_432, —- —4 —11 23 ~—- 22 19 5  j 6 I —21 
Gq" a, 4 3 —-18 16 —1 23 17 7 —9 6 14 —12 
4,420; . . 7 —22 I — 24 —19 —s . we —-6 20 
a,a;* ° “ ° 5 " 5 4 I ° 3 : -s5 
a;*a; -13 2 9 2 12 7 —23 -6 11 -4 -8 -8 
54,4, 5 -2 —18 14 Ir I -11 -13 -6 7 8 -4 
@,a,a,* 5 —% 2 ~~ —15 —— 22 13 — -3 11 3 
a,* : -6 6 6 —3 6 -9 -3 1 2 Z -17 
53420, -1 8 -7 —18 2 -11 41 12 -2 6 -1 30 
a,a,a;* . ¢ 2 2 3 —23 -6 4 -3 2 ~4 
a,a,* “ -3 -6 6 2 3 -6 ‘ 6 
a,a,;*a,* . -3 12 -1 3 —32 -9 : 3 2 —18 
@;a;,a,* ‘ a « -3 1 23 6 . a < 5 
a,a,° " ° ° ; . —4 -! A A ° ° 
aa, —3 5 —2 a -§ 7 7 9 I =—% —6 10 
a2 a2.4; -2 -1 8 3 -2 —4 -8 14 5 -6 -2 -6 
aa; . “ -3 -1 3 2 I -7 . 3 -3 I 
Gy @,' a, I —2 —3% 4 5 -7 -4 7 —% ‘ 5 -3 
@,a3a, 2 Z -2 2 2 3 =F 7 —_ 6 2 —12 
@4434,a;* 5 2 Z —4 | 6 9 —21 : ~s 4 6 
@,a,a;* 12 6 4 Z = -2 -1 7 4 . -1 
@,a,' a, 12 H 3 4 I -3 5 -7 -2 6 
aa," a,* I 1 11 2 3 Zz -5§ 14 ¢ -2 
@, 4,4, io 52 35 11 10 5 Zz “Fe . 
@,a;' 210 140 105 42 35 21 7 Z ° 
a;* dy 7 3 ° ° . . . Z -2 -1 5 
a,*a,* 1 3 4 . a ‘ 2 I -2 -1I 
3° a," ay g I 2 > ° % 5 2 I -3 
a3" aa; Bo 4 29 5 6 2 . > 12 7 3 Z 
a;*a,* 220 130 go 27 20 10 2 3 30 20 10 5 
3a, 78 48 22 > 9 a . ‘ 12 6 4 : 
@,a,*a;* 197 121 71 12 25 7 > - I 18 11 3 
@,a,*a;* 497 16 213 60 74 33 5 : ‘Oo 53 32 14 
34,2; 12 40 611 22 oag 123 3 3 210 150 95 51 
a,a,° 3220 2240 1708 72 68 420 14 25 560 420 280 - 168 
a,*a 430 285 170 30 75 20 e . 80 50 35 10 
@,* a," 1092 735 492 138 aes 88 12 ; 213 144 9 42 
a 27 1920 I 5 00 311 75 570 411 I 
a a,’ a 5 3808 198 1722 1008 3 48 1540 1162 828 3 
aa; 18060 13230 10332 478 I 3108 118 243 4200 3276 2394 1512 
a," 46200 | 34650 | 27720 | 13860 | 13860 9240 3960 990 | II550 9240 6930 4620 
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w=11 (v) (72°) (s*1) (641) (731) (821) (91*) (65) (74) (83) (92) (10, 1) (11) 
ay, 11 I 22 22 22 11 —11 -11 —11 -11 -11 11 
yg, —11 -6 —12 —12 —12 -1 Ir 11 II ir 1) -«¥ 
5a, 7 -11 —22 —22 -—4 -2 Ir II Ir -7 Ir —II 
a,a,* 2 6 12 12 3 1 —t1 —11 —1I —2 —-1 | Ir 
4g Qy -11 -11 —22 2 2 -11 II II —13 Ir II —I1 
4g 4,0, 4 17 34 10 8 3 —22 —22 2 —4 —12 22 
Q,4; -2 -—6 —12 -4 -—3 —1! Ir II 3 2 1 -11 
aa, 3 -11 6 —22 —11 11 -17 Ir II 11 -11 
@;@,a, 17 6 3 10 12 —22 6 2 —22 —12 22 
a;a,* ° Ir 8 8 4 2 —T1 3 —-I1 y -11 Ir 
@,a,a," -6 —23 —18 —I§ = —4 33 5 9 6 13 —33 
a,a,;* 2 6 5 4 3 I 11 —4 -3 -2 -!I II 
aga -11 19 8 —2z2 —22 -11 -19 11 Ir 11 Ir —Ir 
4,4,a; 8 -1 6 34 12 6 —22 —22 —12 22 
44430, 4 ~ 14 20 2 13 8 —22 2 —4 —22 22 
4,434," —10 7 —12 oo —13 = 3 5 9 24 13 —33 
QeQy* a, -4 2 —24 —18 —12 = 3 19 9 a 23 —33 
a,a;,a,° 8 -1 24 19 14 5 —114 —16 —12 - —14 44 
a,a,* —2 . “5 —4 a —3 5 4 3 2 I "ihn 
a;* a, II -9 —133 17 17 6 19 —11 — 2 Ir -6 | Ir 
| 

454442 —10 —8 -1 16 26 13 8 6 —22 —4 —22 | 22 
a;a,a,* —10 12 I —18 =< — 5S —27 5 33 24 13 ag 
@;a;* iI —4 7 —2 -2 II 4 —1I1 13 -11 -I1 II 
;434,a, —16 9 6 —33 —20 —28 —24 38 +6 30 46 — 66 
@;a3a;* 12 -6 -6 I 16 14+ 16 —16 -—12 —26 —14 44 
a,a,* ; -1 2 - -4 -2 I -3 11 -7 II —11 
a;a,*a,* 10 I -3 33 23 9 9 —24 —18 —3 —36 66 
a@sa,a,* —10 : 1 —23 —17 -6 -5 20 15 10 15 —§5 
a,a,* 2 " ° 4 3 I —4 -3 -2 -1 II 
a, a; -3 II -6 —18 10 II —11 17 I —11 —11 11 
a a,a, 13 2 6 12 —34 —14 5 —23 21 15 23 —33 
a,*a,* -1 -3 —§ -3 20 7 8 —-—18 -13 -7 22 
a,a;* a, +~% — 3 27 = —23 18 —23 a 33 23 $3 
14,0," | 10 -3 6 : -6 ~-— 3 ~9 9 —3 33 —33 
44,4;,4, —10 7 —12 —21 45 —12 36 —12 —60 —72 132 
a,a,a,* . . 5 5 =a - ss -5 -8 15 28 15 —$6 
a,a;' a, -—10 -1 2 -2 16 7 —-4 12 —20 10 —34 44 
aa," a? 10 > -1 4 -37 —14 5 —16 30 Ir 50 —110 
@,a,a, -2 -1 20 7 4 —18 —12 —16 

a,a,’ A -3 -1 3 2 I —1I1 
a;*a -—5§ 4 -7 2 8 -8 —4 Ir —13 5 Ir -1I1 
a;* a, 5 2 3 -9 -5 12 -7 6 14 —19 —12 22 
@;* a," a, 5 -4 6 -17 24 9 —24 30 —12 —51 66 
a;*a,a,* -5§ A -3 8 Ir —32 5 —2 —30 47 50 —1I1r0 
a;* a, Z -2 —1 8 2 3 —1I5 -8 33 
Qa; ° I -2 2 -2 2 -1 3 -§ 7 —11 II 
a3a,°a,* ° 2 I —4 9 -16 | -5 12 —10 -7 7° —110 
a, a,' a,‘ 2 6 4 I -6 20 | —4 15 —21 —65 165 
@3a,a,° 13 20 15 6 r —8 é -3 14 17 -77 
a,a;* 56 7° 56 28 8 I é -2 —1I II 
a, a; . 11 5 : 4 ‘| I —3 5 -7 9 —1I1 
a,‘a,;* 6 30 19 4 : | 3 I — 14 —3jo 55 
a;a,° 36 90 65 2 3 | 10 5 I -7 27 —77 
a,* a,’ 155 | 280 217 9 = 2 35 2r 7 Zz -9 44 
@,4, $76 | 882 714 372 117 | 19 | 126 84 36 9 I —11 
a,' 1980 | 2772 2310 1320 495 | 110, 462 330 165 55 11 Z 

i ! 
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Table 2.12 
w =12 (i) (1) (21°) (21°) (31°) | (2*1°) 321") (41°) (2*1‘) (32*1°) (3*1°) 
aye I —12 54 12 —112 —96 -12 105 252 42 
44; 12 I —10 -1 38 19 I —50 —87 -9 
4942 66 10 I -2 - 16 2 20 —32 -17 
aya," 132 21 2 ZI : -9 —!I : 2 9 
4, as 220 45 8 ; I -3 3 —6 I 3 
Gy Gz Q; 660 145 26 2 3 I -3 ‘ -7 —% 
@,a,* 1320 300 s4 2 6 3 I ‘ ‘ - 
AQ Ss 120 2 ‘ 6 “ pi I -4 a 
G3; 1980 25 128 36 27 7 é Z -2 
a," 2970 10 201 72 42 15 2 Z 
@,a,a,;" ) 1665 418 ~ 87 38 8 12 5 2 
@,a,* 11880 3420 870 46 180 94 33 24 12 6 
a, 792 210 56 ‘ 1s @ : 4 ° . 
74,4, 3960 1170 336 84 93 21 : 24 5 ‘ 
G1 A,y, 7920 2460 736 252 207 7° ° 52 17 6 
@7 4301 15840 5040 1528 624 432 168 28 108 40 12 
@7Q, by | 23760 7740 2392 1080 681 302 56 168 72 32 
G1 020 47520 15840 4968 2520 1422 703 204 348 166 78 
@1 21 95040 32400 10320 5760 2970 1620 600 720 380 200 
924 252 7° ° 20 e 6 . 5 
46454; 5544 1722 532 126 16 35 < 52 10 . 
GGG, 13860 4620 1526 504 49 161 a 160 5° 15 
464, 27720 9450 3164 1218 1041 378 56 336 115 30 
Geas* 18480 6300 2128 756 707 252 i 228 81 30 
43,0, 5440 19740 6888 3024 2337 1008 168 760 320 132 
gs a,° 110880 40320 14280 6888 4890 2289 588 1596 722 306 
a. . 83160 30240 10752 5040 3696 1722 336 1206 552 262 
43° a; 166320 61740 22288 11340 7737 3878 1064 2532 1241 608 
44,0," 332640 126000 46200 25200 16200 8680 2940 5316 2780 1440 
G,a;° 665280 257040 95760 55440 33930 19320 7560 11160 6210 3440 
6 a, 16632 5670 1932 630 660 210 d 228 7° 20 
@s* a; 332604 11592 4004 1si2 1380 490 7° 480 160 40 
s 443 27720 9870 3528 1260 1266 455 ; 456 165 60 
9544420, 83160 30870 11396 4914 4188 1771 280 1536 635 250 
524A 166320 63000 23604 11088 8766 3983 966 3240 1420 57° 
@,a;°a 110880 pe 15848 7056 5952 2632 420 2220 976 420 
CAA 166320 4.260 24696 11592 9417 4410 840 3552 1652 776 
Qs 34,4," 332640 131040 51128 257 19722 9793 2548 7500 a 1762 
@,a3a,* 665280 267120 105840 5 41310 21644 6888 15840 132 4056 
@5a,° a, 498960 200340 79632 41580 31221 16212 4536 12012 6177 3192 
@,a,*a,* 997920 408240 164808 90720 65412 35658 11844 25380 13656 7308 
254201 1995840 831600 341040 196560 137070 78120 2 53640 30130 16800 
ar 3991680 1693440 705600 423360 aes C) 170520 70560 113400 66360 38640 
a) 34650 12 4620 1680 1710 630 . 39 240 go 
Gq' 30 138600 53550 20720 9240 8040 3570 560 3132 1390 600 
>, San . 207 81900 32270 15120 12720 5950 1120 5022 2340 1ogo 
Gy Gay 435 00 166950 66780 33390 26640 13160 3360 10620 5180 60 
a,*a,* 31600 340200 138180 73080 55800 28980 9030 22464 11440 5620 
yas, 277200 111300 44800 21420 18075 8680 1680 7308 3525 1710 
445" a; 554400 226800 92680 47040 37860 19110 4900 15468 7780 3540 
443," a, 831600 346500 144200 75600 59925 31360 8680 24864 12980 6760 
4 43a,a,° 1663200 705600 298200 163800 125550 684905 22260 52668 28530 15270 
@,a3a;° 3326400 1436400 616560 352800 263070 149100 54600 111600 62600 34 
a,4,* 1247400 529200 224280 120960 94860 51240 15120 40002 21600 11760 
@,a,° a," 2494800 | 1077300 463680 260820 198765 III510 37800 84780 47385 26460 
@,a,"a;* 4989600 | 2192400 958440 559440 416520 241920 91140 179748 103780 59640 
@,a,a,° 9979200 4460400 1980720 1194480 872910 523320 214200 381240 220950 134400 
aa,° 19958400 9072000 4092480 2540160 1829520 1128960 493920 808920 495600 30: 
a;* segeee 151200 62160 30240 25680 12600 2520 10656 5280 2640 
@;° aa, 1108800 470409 199920 105840 85140 45150 12600 36336 19320 10260 
a,*a;* 2217600 957600 413280 228480 178380 98280 31920 77040 42360 23 
a;*a,* 1663200 718200 310800 168840 134760 73500 21840 58536 32040 17 
a," a,*a,* 3326400 1461600 642320 362880 shuste 159460 54040 124176 70120 3 520 
G3" a,a, 6652800 2973600 1327200 776160 591660 344960 129360 263520 153220 560 
a," a, 13305600 048000 | 2741760 | 1653120} 1239840 744240 302400 559440 334320 198400 
yy a; 4989600 | 2230200 997920 574560 446940 257880 90720 200328 115800 7200 
@,a,°a,° 9979200 | 453 20613 1224720 936540 556290 214200 425376 252600 149940 
@3a,'a;° 19958400 223200 | 4257120 | 2600640 1962540 1197000 495600 3 550240 334320 
303; 39916800 | 18748800 8789760 pent * od 4112640 2569560 1128960 1920240 1197900 744240 
a,a,° 79833600 | 38102400 | 18144000 | 11612160 | 8618400 | 5503080 | 2540160 | 4082400 | 2600640 1653120 
a,° 7484400 3402000 1549800 907200 707400 47 5800 1§1200 323460 190800 113400 
a,5a;* 14968800 Tessas 3200400 1927800 1482300 94600 qszece 687240 415500 252000 
a,‘a,* 29937600 14061 6607440 082400 | 3106080 | 1920240 08920 | 1460736 903 559440 
a,°a;° 59875200 | 28576800 1363 ago 8618400 : 508620 | 4112640 | 1829520 | 3106080 | 1962540 | 1239840 
a,*a,° 11975 58060800 | 2 1433 18144000 13638240 87897 4092480 | 6607440 | 4257120 | 2741760 
a,a,"° 239500800 |117936000 | 58 oo | 38102400 | 28576800 | 18748800 | 9072000 | 14061600 | 9223200 6048000 
a," 479001600 |239500800 [119750400 seasguee 59875200 | 39916800 | 19958400 | 29937600 | 19955400 13305600 
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w=12(ii) | (421%) (51”) (2*1*) (32%1*) | (3*2x*) | (42"1*) (431°) (521°) (61°) (2°) (32*1) 
aA: 12 —36 —240 — 180 — 180 —72 —72 —12 2 60 
4a, ™~“ 3 25 130 7o 7oO 17 17 I —2 —49 
4, - 4 -2 —16 . 7o 20 32 12 2 2 20 
aa," I ‘ —30 —45 —20 -1 _ -1I < 9 
@,a; -21 -3 9 —21 —36 45 I I 3 -2 -6 
Gy, a,a, 2 3 ° 14 13 — 54 —22 —20 -—3 ° Se 
a,a,* - -1I “ . 20 8 7 I 4 
yy 4 —4 —4 16 =? —20 -8 24 4 2 —4 
Gy 3a, —1 4 . -5 10 5 5 —327 ag 5 
@,a,* -2 2 ; -6 12 < —12 -2 > 
@,a,a,* Zz 4 e é ‘ -6 -1I 27 4 ° 
a@,a;* 4 I e . < . P -7 -1 » 
a,a, ‘ P Z -5 5 5 -5 -5 5 —2 to 
@744a, P ° 5 I -3 -1I 5 I -§ ; -3 
;@3Q, ° : 10 3 I —2 -1 5 -5 4 » 
@,a,a;" 6 . 20 7 2 Z —2 -1I 5 ‘ 4 
@,a,"a; 13 ‘ 30 12 5 2 I -3 5 e » 
a@,a,a,* 46 7 60 27 12 7 3 Z -—5 ° . 
a@,a,° 135 36 120 60 30 20 Io 5 Z é . 
- . . 2 ‘ é ts ‘ - : Z —6 
26454; . ‘ 17 3 J ‘ ’ ? 4 6 Zz 
44444, Fa 50 15 4 . : ° ‘ 15 4 
@,a,a,* 15 é 105 34 8 4 « . e 30 9 
a,a,;* . < 7° 24 9 ‘ . y ° 20 6 
9¢434,a, 51 7 230 9. 7 14 5 i 60 22 
@,a;,a;° 174 21 480 20 4 47 15 5 n 120 48 
a,a;* 108 ; 360 159 72 30 16 A a 90 36 
4,4," a," 331 42 750 354 167 90 43 11 : 180 78 
@,a,a,* gI0 196 1560 786 392 245 120 50 6 360 168 
2, 2355 672 3240 1740 930 630 335 171 37 720 360 
a," a, e 7 81 24 6 ‘ ‘ 4 . 30 9 
a,;*a,* 20 . 172 54 12 6 a ‘ 4 66 20 
454,43 ‘ 165 60 22 4 - ‘ 60 22 
54,42, 95 ° 565 228 86 32 Io r e 210 83 
@,a,a,* 320 35 1200 507 192 106 30 10 : 450 18 
5a" a, 150 F 820 357 154 53 20 ; i 300 12 
53a, 315 . 1320 606 279 114 55 . is 480 216 
@;@,a,a;" 926 105 2800 1344 632 27 145 35 ° 1020 476 
4,a;a;* 2489 476 5940 297 1442 72 390 155 15 2160 1048 
4, a,* a; 1713 210 4500 2277 1153 618 321 75 ‘ 1620 798 
@,a,*a,* 4446 882 9540 5037 2640 1593 828 306 30 3420 1752 
@,a,a,° 11085 2856 20220 11130 6070 3970 2135 985 171 7200 840 
@;a," 26880 8232 42840 24570 14000 9660 5460 2856 672 15120 400 
a? ° ° 240 93 36 ° > ‘ : 90 36 
a) asa, 210 ‘ 1225 546 238 80 30 > A 480 216 
aa," 440 . 1990 924 424 172 80 . 795 68 
a, a,a,;* 1280 140 4245 2046 954 8 210 5° s 1710 17 
aa; 3420 630 9060 4524 2160 1294 560 220 20 3690 1812 
4,45" ay 690 ° 2960 1434 699 284 140 - 1200 58. 
@,a,* a," 1950 210 6320 3174 1570 781 360 80 2580 129 
44434," a; 3575 420 10300 5361 2779 1466 755 170 R 4260 2210 
@,4,a,a,* QgIIO 1715 22020 11841 6284 3711 1905 675 60 9180 4900 
@,a,a,° 22425 5460 47100 26130 14250 9140 4800 2135 335 19800 10860 
aa, 6420 40 16800 54 4900 2694 1500 360 . 7020 Ld 
aaa," 15075 3150 35940 19986 11085 6678 3705 1305 120 I5120 352 
44a,* a," 38670 9660 76920 44088 25120 16201 9140 3970 630 32580 18504 
444,a, 91725 26880 164700 97200 56990 38670 22425 11085 2355 70200 40980 
a,a,* 214200 70560 352800 | 214200 129360 QII140 54600 29400 7560 151200 90720 
a,‘ 1080 ‘ 4440 2220 1128 468 240 . e 1860 936 
@;* a,a, 5400 630 15540 8289 4428 2328 1230 270 6660 3564 
a;*a,* 13590 2520 33300 18300 9972 5820 3060 1050 90 14400 7920 
a;*a,* 9630 1260 25470 13986 7728 4248 2390 370 . II100 6114 
a" a," a," 23720 4620 54640 30858 17404 10428 5840 2020 180 24060 13592 
G5" 4,4; 56990 I fs) 117300 68040 32244 25120 14250 6070 930 52200 30220 
a;* a, 13 38640 Pod 149940 560 59640 34600 16800 3440 113400 67200 
3a,‘ a; 40980 8400 9850 52020 30220 I 504 10860 3840 360 320 23376 
@,a;'a,* 97200 24570 193140 | 114657 68040 4408 26130 11x30 1740 7660 52020 
a@,a,*a,° 226950 | 66360 $3500 252600 153220 103780 2600 30130 6210 190800 115800 
34,4," 523320 170520 94600 | 556290 3449) 241920 149100 78120 19320 415800 257880 
a,a,* 1194480 423360 1927800 | 1224720 776160 559440 352800 196560 55440 907200 574560 
a,’ 70200 15120 148140 87660 52200 32580 19800 7200 720 67950 0320 
a,'a,;? 164700 42840 318930 | 193140 117300 76920 | 47100 20220 3240 148140 9850 
a,‘a,‘ | meme 113400 687240 | 425376 263520 179748 111600 53640 11160 323460 200328 
a,'a,* 2910 | 287280 1482300 | 930540 591660 416520 263070 | 137070 33930 707400 446940 
a,*a,* | 1980720 | 705600 3200400 | 2061360 | 1327200 958440 616560 | 341040 95760 | 1549800 | 997920 
a,a,'° | 4460400 | 1693440 S917400 | 4536000 | 2973600 a 1436400 | 83 1600 357040 | 3402000 —— 
a,'* | 9979200 | 3991680 | 14968800 | 9979200 | 6652800 | 4989600 | 3326400 1995840 | 665280 | 7484400 | 4989600 
| | 
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w =12 (iii) | Gta? | (42*1*) 
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(31°) | (432n%) | (s2%x*).| (4*r*) | (s3x*) | (G2x*) | (71°) | (3*2*) | G42) | (*2n) 
as 180 120 40 240 120 30 60 60 12 —24 -—12 —48 
44, —114 —76 —18 —108 — 54 -8 —16 —16 -1 24 33 
- 10 —30 — 100 —10 -1 —3o —10 —2 - - 
oa = 16 38 68 14 3 16 6 I -9 -2 -27 
aya; 63 —-39 17 33 —30 —2r —15 —15 = —— 12 —42 
heed BSS es EE eg ee i fe 
a . ~ ° - _ = = pa a4 . . 
=a —28 24 8 48 —40 6 —20 —20 —4 8 —12 16 
430, = ~—— oe -27 52 -3 23 23 4 =—— : 13 
aa," 9 —18 —12 18 —1I 14 10 2 —2 4 12 
@0,a;" Lie ‘ Il eo I ah % a ‘ -2 . 
a, . . ° : . . . . 
PP a =—3S —< 5 20 5 10 —25 —5§ —1Ir 12 13 
@,4,Q, 9 3 3 -17 —4 5 —- 29 5 3 : —18 
7 Q3Q2 —4 -: -1 Z = : “’ = ; 2 “=. 7 
@7a3a e - : - - Pe e 
a,0,°4, é ° ‘ -5 15 —-1 2 —27 -5 p ° . 
7424," ° ° . -§ ‘ -1 3 5 é é ° 
a . . . . : . . a -_ . . . 
a 9 6 -2 —12 -6 3 6 6 -6 6 -6 -12 
a: - —!% 2 7 I -3 -6 —1% 6 -1 ° 7 
aaa, z —2 -2 4 2 -3 2 —6 6 -2 4 4 
aa efic-sh ed Opp ecape 3] e-a)e--al- sh- Th Rp Aas 
QQ; ° - - - 4 A 
4443424, 8 3 3 I = =3 4. 7 —12 . : 
@,a;a;° 18 10 6 3 I : -2 -1 6 ° ° 
a,a;* ss 15 6 6 3 ; I -2 2 -2 4 . 
os *TOEe lake EP ee | es ee Jiows 
42a: = . . 
« a? R 180 120 go 60 30 20 15 6 Z ° 
a;* a, 2 ° ° ° - ; r . Z -2 -3 
a;*a;" 4 2 - Fy . 3 z ‘ $ I e 
45 4,a3 7 a 3 : " : . 2 t : 
5 4,a,a ‘ . ® ° 
get j 66 36 8 9 3 " . ‘ = 13 6 
nh 36 pod Uhm ib ‘ 4 31 12 13 
ds Az @:; 9 ° ‘ 
SM ee ee es ae ee a are 
4, aa: 3 
@,0,° a, 390 207 189 103 24 30 13 ° 117 60 51 
a; a," a; 886 528 264 96 78 48 9 . 258 150 117 
asa,a;° 2020 1305 1050 675 306 220 155 5° 5 570 360 270 
a;a,;’ 4620 3150 2520 1715 882 630 476 196 36 — 840 630 
a? 15 ° ° 3 ° . ° ° ° 
SS oe ee ee rh a ee ae ee 
tata! 37° 189 = 78 18 ~ | 6 ; 153 76 66 
2a,* 201 I 
> at a, 286 « te "A . . +4 ™ ° as 48 5 
a,a;*a,;* 644 314 318 147 31 24 12 264 126 132 
s 82 ° 68 80 4 ° ° 467 244 237 
a poe Rs ssos = 264 204 m4 22 \ 1056 610 540 
4443; 5840 3705 3060 1905 828 560 3 120 10 2390 1500 1230 
a,a,* 2026 1116 1080 610 150 201 8 » ‘ 28 456 438 
a,a;2a;" 4592 2754 2505 1506 528 498 291 48 . 1875 1116 1008 
a,a;*a,* 10428 6678 5820 3711 1593 1294 872 245 20 4248 2694 2328 
a,a,4,° 23720 | 15975 | 13590 gI10 4446 3420 9 gio 135 9630 6420 5400 
aa, 5 37800 | 31920 | 22260 11844 9030 6888 2940 600 | 21840 15120 12600 
a;* t) 204 256 108 a 24 ° ° . 204 108 
@,° tat, 1914 1008 1038 540 117 144 60 828 438 451 
3 250 2 1332 60 216 36 P 188. 1080 1038 
el D 6 sors “3st | 258 Po 150 . i 1486 828 828 
a;*a;*a,* 7672 4592 4332 2578 886 8 488 78 ‘ 3386 2026 1914 
a,*a,a;* 17404 11085 9972 6284 2640 2160 1442 3902 30 7728 4900 4428 
a,*a,° 39520 26460 23040 15270 7308 5620 4056 1440 200 17660 11760 10260 
G34‘ a; 13592 8352 920 4 1752 1812 1048 168 > | 3768 356s 
@,a,°a;° 30858 | 1 18300 11841 5037 4524 2976 786 60 | 1398 9054 289 
a3a,*a,° 70120 47385 42360 28530 13656 11440 8132 2780 380 32040 21600 19320 
34,4; 159460 | 111510 | 98280 68495 35658 28980 | 21644 8680 1620 73500 51240 | 45150 
aa,’ 362880 | 260820 | 228480 163 90720 73080 56448 25200 5760 | 168840 | 120960 | 105840 
$ 24060 15120 14400 9180 3420 3690 2160 360 . II100 7020 6660 
a,‘a,;* 54640 354° 33300 22020 9540 9060 5940 1560 120 25470 16800 15540 
a,‘a;* 124176 780 7 | §2668 25380 — 15840 33 16 720 58536 40002 3 336 
a,*a,° 282360 19865 17b380 | 125550 65412 55800 41310 16200 2970 | 134760 94860 5140 
a,*a,* om ad 463680 | 413280 298200 7 08 | 138180 | 105 46200 10320 | 310800 | 224280 | 199920 
a,a,"* 1461600 |1077300 | 957600 | 705600 240 | 340200 | 267120 | 126000 32400 | 718200 | 529200 | 470400 
a,* 332! lpenaieo 2217600 aba 997920 31600 665280 | 332640 95040 (" 663200 |1247400 | 1108800 
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w = 12 (iv)| (432%1) | (522) | (43%1*) | (4*21*) | (s321*) | (62*2*) | (s4n*) | (631°) | (721*) | (814) (3*) (4372) | (4?2*) 
a3 —144 —48 -72 —72 +s —72 —48 —48 -48  -12 3 36 18 
14, III 37 39 39 7 39 I I 15 1 “3 —36 —18 
19 Ae 24 —12 62 32 2 2 2 2 -3 —16 2 
192; —51 —7 —39 — 24 - a —15 —15 —5 —1 3 26 8 
ya 9 21 -9 45 9 I 39 12 12 3 6 18 —18 
Gy Q2Q; —36 —16 —20 —62 —43 —23 —43 —16 —14 -3 -3 -2 16 
a,a;* 12 24 21 9 15 6 X -8 -8 
Ay -! 16 —40 —40 48 24 16 16 16 4 —3 -4 22 
2_43.0, 16 —26 3 16 — 63 = —22 —19 —19 -4 -3 —14 -4 
24 -4 = 16 —32 -10 —12 —12 —8 -—2 3 —14 
44,4," -17 23 —§ —10 75 32 26 23 19 4 8 8 
aya, — —21 -9 <7 -6 —§% =s 
aQ,as 4 —22 2 2 —38 37 —22 13 13 12 “3 a | —18 
Q,4,a, 3 3 15 15 21 —42 II —24 —24 ~§ s re 
G14, a, —19 19 —4 7 39 ~a 3 —26 —20 -5 -3 5 16 
@,Q3a; . ~3 -9 a —1I5s 42 <3 2 24 5 2 —4 
a, a," a 5 —15 5 4 —23 33 ~% 2 22 5 =§ 72 
@,Q,Q; 5 : 9 —4!1 I —29 —24 =§ ‘ 
a,a,° . 6 5 I ‘ 
8 12 18 —18 —12 —12 24 6 3 9 
45a; -1 -1 —20 I 7 2 19 14 —28 —13 -3 I ‘ 
4440, -8 —4 2 8 —16 22 ¢ 4 —24 —6 —3 8 —12 \ 
44,4," 6 2 3 "a = 7 om =e 29 6 : 7 8 
aa," 9 -9 -9 9 9 ~% —12 —3 3 -9 . | 
4,4,4,a, == 9 =% 11 =—55 —25 2 -F 53 12 5 —4 
4434, ‘ —3 . 6 3 2 3 ie | —-6 x > 
-4 8 - 2 2 | 
@,a,"a,* . ™) 16 at 3 —41 ~-9 
44,4, : “a =< 29 6 
@,a,° ey a 
Gs" a, 6 2 3 a —4 —2 7 ~3 7 =9 3 -9 8 
a,*a;* -4 —1 2 4 3 I -79 —2 -1 7 4 4 -—4 
G5, 4, ag 3 =—% 5 =2 —- is ’ H -7 -3 II -—4 
, 444, 4; Z -3 -2 at | 8 3 —2 —4 - 14 -1I 
454, a, 3 Z ° ° =<9 a. 3 2 I -7 
4,4," a, 2 . I —2 ne 4+ 5 7 —4 7 -2 4 
@; 43a," 5 2 Z —2 2 -1 3 -7 -2 
Gs a; 4,a,* 13 3 5 2 I =4 -3 6 9 —21 
@,a;a,* 34 13 12 6 + Z —2 =2 7 
45a," ay 27 12 7 3 . Z =3 5 a7 
a, a,°a,? 68 24 1 18 Ir 2 3 I -5 14 4 : 
44,4, 170 75 C) 50 35 II 10 Z —7 : ° 
3a, 420 210 210 140 105 42 35 21 7 I 4 : 
a? ° ° . . . ° Z -4 2 
@,' a3 a; 5 2 4 ZI -2 
a,' a," 12 + 2 6 2 I 
a,'a,a,* I 7 10 4 2 12 5 2 , 
aa; oO 30 24 12 8 2 24 12 6 ( 
Gy" a, 24 ° 12 5 . ° F ° ° ° 12 5 2 | 
aa," a,;* 60 12 29 10 5 ° é " ; i 28 12 4 
ya,a,* a, 123 27 60 31 13 ° 3 » . . 48 24 12 
4,4, 4,4; 305 103 147 78 47 8 9 3 ‘ ; 108 58 ° 
@,a,a,* 755 321 360 210 145 43 30 15 3 . 240 140 ‘0 
a, 244 126 7° 34 13 go 28 | 
@,a,' a," 599 2 314 I 2 110 18 36 10 2 117 68 
a,a;*a,* 1466 61 781 48 327 go 106 47 H asd 284 172 } 
44,4; 5 1713 1950 12850 92 331 320 I 4 4 1060 [oe] 440 
aa, 888 4536 4900 3360 2548 1064 966 388 204 33 2520 1680 1120 
a,‘ 28 12 ° ‘ 24 12 6 { 
@,* a4, 237 51 132 66 27 6 108 58 30 
a,'a,* 582 189 318 162 96 15 18 6 256 141 72 
*a,* 4 117 2 153 68 24 204 117 68 
@,* a," a," 1140 3 6 376 216 34 66 18 480 286 168 
@,* a4; 2779 1153 1§70 954 632 16 192 12 1128 699 424 
a,*a,* 6700 | 3192 | 3840 bo | 1762 60 570 306 78 6| 2 1710 | 1090 
@,a* a, 2210 798 1296 17 476 78 184 936 584 368 
4 a,°a;* 5361 | 2277 | 3174 | 2046 | 1344 354 507 208 27 2220 | 1434 924 
a, a,*a,* 12980 6177 7780 5180 3670 1241 1420 722 166 12 5280 p25 2340 
4, 44a, 31360 | 16212 | 19110 | 13160 9793 3878 ‘| 228 703 94 12600 3 80 5950 \ 
@,a,° 75 41580 | 47040 33390 25704 11340 1108) 688: 2520 46 30240 21420 15120 
a,* 4260 1620 2580 1710 1020 180 450 120 1860 1200 795 
a,'a;* 10300 4500 6320 424 2800 750 1200 480 60 2960 1 
a,‘a,* 12012 | 15468 | 10620 7500 2532 3240 1596 348 10656 08 5022 
a,°a,* $9925 | 31221 | 37860 | 26640 | 19722 77 oa 3° 1422 180 | 25680 | 18075 | 12720 
a,*a;* 144200 | 79632 | 92680 | 66780 | 51128 | 222 23 14280 870 | 62160 00 2270 
a,a,"* pibsoo pris 226800 | 166950 | 131040 | 61740 | 63000 acne 15840 3420 151200 | 111300 1900 | 
a, 31 498 554400 | 415800 | 332640 | 166320 | 166320 | 110880 | 47520 | 11880 |3 277200 | 207900 t 
| 
' 











ore 

















eee 


SR e_ 


F. N. Davin anp M. G. KenpAti 


Table 2.12 (cont.) 


453 











w=12(v) | (532%) | (62%) | (4%31) | (5371) | (5421) | (6321) | (72"2) | (5*x*) | (641") | (731%) | (82x*) | (or*) | (4%) | (543) 
a: 36 12 36 36 72 72 36 18 36 36 36 12 —-4 —24 
4, —36 —12 —25 —25 —5° —50 —25 a | -1 —1 -1 —1I 4 24 
x94 4 8 —36 —36 —32 —32 4 —13 -2 —2 - -2 4 24 
G0 Q42 16 2 25 2 30 30 5 Z 14 14 4 I —4 —24 
a, Qs -9 —12 -9 1 —45 9 —9 =F —36 -9 -9 -3 ¢ -3 
Gy 42,8, 5 4+ 34 7 73 19 12 20 40 13 ir 3 ” —2i 
a,a,* | 2 —16 7 90), 33 —5 — —14 -5 —4 = 4 I 
45 —20 4 44 -—4 —8 —40 —20 -1 -4 -4 -4 —12 —12 - 
5432; 29 8 —34 = 31 33 18 25 18 15 15 4 8 II 
a@,a;* 4 . —4 20 $ 16 4 13 10 10 6 2 4 -8 
@,a,a;* —21 -6 9 —24 —39 —41 —17 —27 —22 —19 15 -4 -4 13 
a,a,;* 7 2 ¢ 7 14 12 5 q 6 5 é I x -7 
a,as 34 —12 s “ae 33 “37 = 17 —1 —1I -% —12 ¢ —23 
474, —14 8 —26 —6 -1I 55 —10 II 11 18 13 19 
7 Qa, —23 4 10 -17 -2' 30 12 —4 27 21 15 5 -8 14 
a,a;,a;" 4 -10 16 11 16 —49 —23 3 —25 —22 —19 -5 4 —22 
a,a;*a, 19 3 -9 8 18 —42 —16 2 —29 —23 -17 -§ 4 -6 
a,a,a;° -7 : —4 -8 53 22 —$ 29 24 19 5 . 7 
a,a,° ‘ -2 ; ‘ ‘ -12 -§ ‘ —6 —5 —4 -1 é ! 
a,* —-18 6 -—18 36 -18 9 18 —18 -18 -6 2 12 
4645, 2 . 26 5 —25 —27 26 —16 —2 15 50 13 -8 -13 
464,42 16 —12 -8 4 16 —24 16 -5 -1 30 10 14 8 —16 
G,4,a;° +4 2 I 2 7 5 —29 9 17 —25 —22 —14 —4 5 
a,a;* 9 . 9 -9 9 —27 9 : : 9 9 3 = 3 
443420, —11 8 —10 14 —-10 56 —42 I 5 —49 —41 —12 -4 
a,4;a,;° 3 c é -! —4 —II 28 -2 -3 27 23 6 F Z 
a,a;* —4 ¢ 4 - ° 8 —4 -1I 2 —10 —6 -2 -4 
@,a,*a;" 2 - : 4 3 —25 33 I “9 42 32 9 : =F 
a,a,a,* ‘ 2 . ‘ 7 —27 é I —29 —23 -6 ¢ 
a,a;* * x 2 5 : 5 4 I 
as? ae —< 2 I I os 17 —19 —% -4 -4 21 7 —4 II 
a;*a,? I -1 8 -5§ 10 I 2 7 9 3 —27 -7 6 I 
As Qya3 -11 8 -10 11 8 -—4 —6 I 5 —22 13 15 8 —14 
543424, 3 . 10 —10 —10 —10 18 10 7 16 —39 —jo -8 8 
a@,a,a;* -1I x ‘ 5 —2 2 -1I -7 -8 -5 26 15 s -§ 
53" a, 2 -8 . ; —10 14 8 -—5 2 II —24 -7 -4 Ir 
53a," 4 a  # 2 3 —11 19 I —2 1 +38 ]— +7 | =m 
@;@3a,a;" -2 8 ‘ -1 8 -11 —23 3 -3 —15 75 21 ; —2 
a;a;a,* < -—2 ° . 4 2 é 2 3 —27 -7 ‘ 
Gsa;* a, . —3 9 cag =<- 2 -2 23 7 3 
asa;*a,;* ° x ‘ ° . -3 15 é -1 4 —51 —14 ; ; 
@;a,a,° . 6 ‘ > i . -3 4 ‘ -I 27 7 ‘ ss 
asa," ‘ . : A ‘ ° . ‘ a . —4 -1 " a 
a? 4 —4 4+ -4 —8 8 4 6 —4 —4 -4 4 —4 8 
a2 asa, -1 4 -1I ‘ 10 —10 -9 -8 I 16 2 —16 4 -10 
aga: 2 2 -2 4 ° —4 = —4 8 -—4 -8 2 —4 
a,? a,a;" Z —4 ‘ -2 -1 11 4 4 -9 -6 —10 + ; 5 
a,*a;* 4 I > ° ‘ -3 -1 ‘ 3 2 I - . é 
4, a3" dz 3 ‘ I -2 —1 5 -5 4 -7 2 8 -8 —4 Ir 
a,a;*a;" 2 2 Z -2 -1 5 2 3 -9 -§ 12 ~ 1 
4,434," a, A 5 2 Z -3 5 -4 6 : —s5 24 . -3 
@4@34,a;* I 3 12 7 3 Z -5 . -3 8 II —32 7 . 
4,434; 55 16 30 20 10 5 I . ° -2 -1 8 . * 
a,a,* 12 ‘ 12 6 4 . = Z -2 2 -—2 2 . ‘ 
a,a;,a;* 39 6 I 18 Ir 3 > 2 Z -4 9 —16 : " 
a,a;*a,* 114 30 ) 53 32 14 2 6 4 Zz —6 20 : _ 
a@,a,a,° 315 108 210 150 5 51 13 20 15 6 I -8 é i 
a,a;* 840 336 560 420 280 168 56 7° 56 28 8 I ‘ ‘ 
a;‘ F ; 4 ° . ° . ‘ ° ° . . I -3 
@;° a,4; 12 18 7 3 3 Z 
a,*a;* 42 6 42 24 9 3 ‘ 4 7 » ‘ 6 3 
a;*a,* 31 ‘ 39 18 11 : ‘ 2 - . 3 6 3 
a;*a,* a," 96 15 96 54 30 8 2 4 2 1 ; 15 
a;*a,a;* 279 72 238 154 86 37 5 12 8 2 . 36 22 
a;*a,° 776 262 600 420 250 132 32 40 30 12 2 90 60 
3a,‘ a, 216 36 216 128 8 22 ‘ 20 9 > ‘ 36 22 
@,a,°a,° 606 159 546 357 22 93 12 54 34 7 5 . 93 60 
a,a;*a,° 1652 552 | 1390 976 635 320 72 160 115 40 ; . 240 165 
34,4, 4410 1722 3570 2632 1771 1 302 490 378 168 3 3 = 455 
3a, 11592 5040 9240 7056 4914 3024 1080 1512 1218 624 189 2 1680 1260 

¥ 480 90 480 300 210 60 ; 66 30 ° ° . go 60 
a,°a,;* 1320 360 1225 820 565 230 30 172 105 20 . ° 240 165 
a;‘a,* 3552 1206 132 2220, 1536 760 168 ) 336 108 12 ¢ 639 456 
a,3a,° 9417 3696 040 5952 | 4188 2337 681 1380 1041 432 8 6 1710 1266 
a,*a,* 24606 | 10752 | 20720 | 15848 po 2392 3164 1528 41 54 | 4620 3528 
a@,a,'° —_ 0240 | 53550 | 42000 0870 | 19740 7740 | 11592 9450 5040 1665 300 | 12600 9870 
a,"* 166320 | 3160 |138600 | 110880 3160 | 55440 | 23760 | 33264 | 27720 | 15840 | 5940 1320 | 34650 | 27720 
























































Tables of symmetric functions 


Table 2.12 (cont.) 





(741) 


1 
(10, 2) 














(732) | (82*) | (651) (831) | (921) |(10, 1%)} (6) | (75) | (84) | (93) (11,1)| (12) 
a3 —i2) ~12 | —24 | —24| -12| -24| -24] -24| -24] —122 6 12 12 12 12 12 | —12 
44, 12 12 24 24 12 13 13 13 13 I —-6 |) -12|] -12] -12] -12 -1 12 
Ayo Gy 12 2 4 4 -8 24 24 24 4 2 —-6/) -12| -12] -12 8} -12 12 
aoa," 12 -7| -14!] -14 —2] -13 | -13 | —13 -3 -1 6 12 12 12 2 1}| -12 
Gy, —15 12 24 -3 12 24 24 -3 - 12 -6| -12/] -12 1s | -12| —12 12 
Oy 0,2 3) -14] - —I) -4] -37| -37] -10] - —3 12 24 -3 4 13 | -24 
Qa; 3 7 1 5 2 13 I { 3 I -6! -12] -12 -3 -2 -1 12 
aa 12 12 - 24 ~ 24 = - 24 12 -6| -12 zo | -12] —12| —12 12 
@_43, 3 | -24| -16/ —ar - -37 - -2| -10| —13 12 24 -8 -3 24 13 | —24 
aa," —12 -2 12 —4 —24 - -8 -4 -2 6 12 —4 12 - 12 | -—12 
@,a,a,;" 9 21 10 15 6 50 18 15 Ir 4| -18 | -—36 —4 -9 -6/] —11%4 36 
@,a;* —3 -7 -6 -5 -2| -13 -5§ —-4 -3 1 6 12 4 3 2 1] —12 
G7 Qs 12 | —23 24} —11 12} ~—11 | —11 24 24 12 -6 23} -12] -—12]| -12] -—12 12 
@,4,a, —24 Ir} —16/ —1 -8 -2 2 -§ | —-37| —13 12} -11 -8 24 24 13 | —24 
@, 30; 3 21 | —28 - —4| -13 | -13 | -21 -1 | -™% iz2| -11 24 -3 4 24) —2 
@,a3a;* 9 -4 go 21 10 15 11 I 13 14| -18 -1 -4 -9}| -2 -14 3 
Gy a," ay 9| -19 16 12 ¢ 26 24 I 12 5} —18 —1 | -—20 -9 ¢ —25 36 
@; 43a; -12 7 ie —20 - —28 | -24]| -19] —1%4 -§ 24 3 16 12 15 | —48 
@,a,* 3 : 5 2 6 5 4 3 1 - —-s| -44 “3 -2 -1 12 
8 —12 6| —24 12 6) -24 12 12 12 6 15 -6 -6 -6 -6 -6 6 
45a, 12 II 24 -1 —24 4° -2] -37| -3 —13 | -24] -—15 % 24 24 13 | —24 
240, a, 12} —14 2 —2 12 -16/} —16| —2! —14| - 24 - 24 4 24 | —24 
@,a,a,;* - —-4| -18 27 10 | —27 Ir 18 40 14 I -1 -4 | -36|] -—2 —14 36 
a,a;" 6| -—12 12 3| -12 12| —24 3 3| -12| -—12 12 12| -15 12 12] —12 
@4430,0; sr | 17| —24 30 16 | —27 55 33 19 30 36 | -37 | —40 9] -32] —s50 72 
4,43a,° -3 4| -26] -12 14| -24]| -19 | -—16| —15 | -—t2 13 16 12 28 15 | —48 
a,a,* : 2| -12 4 ‘ é 5 8 4 2 6| -12 4| 12 8] -12 12 
aya," a;* 9 -2 22 | -—27| -10 2] -42] -33| -—23 -9| —18 37 24 18 2 39 | -72 
44,4; ~% ° -6 25 10 -I 29 23 17 6 6] -25| -—20] -—15/ -10| -—16 60 
a,a,° * ; -§ -2 . -5 -4 -3 -1 g 5 4 3 2 1] —12 
as" a, —12 3] -14 21 -2 II 11 | —24] —14 -7 6| -23 12 12 2 12] -12 
a,* a," ‘ -7| -s| -4 13 | —16] -10 25 20 7 9 17] -18 | -18] -13| -7 18 
@54,a, 3 1r| —16 I ~ -13 19 11 | —21} —24 12| -1r -8 -3 24 24 | —24 
44,430; 9} -3 16] -2 -25 | -19 31 73 30 : 33 | -8| -45] -32] -—s5o 72 
@,a,a, 5 7 + 3 | -32 19 II —22 |) -43 | ~15| -12/ —22 16 3 28 15 | —48 
ee -9 I é -17 20 5 -6 -1 7 25 : -1 -4 I -—36 | -—25 36 
@,a;a; 9| -4 1 —23 4 2] -14 29 5 16 | —18 34 | -20| -9 4] -36 36 
@5 aa, a," 9| -4| -16 39 | -—32 7 21] -63 | -43 | —48 - | —33 48 9 64 78 “398 
@,a,a,* -3 . 2 -4 14 -6 -6 23 19 16 6 10} -—20/ -15| -30| —16 
a,a,' a, -9 2 -4 19 -4 ca | 3| -26/] -16 -7 12 | —22 16 21] —12 37 | -48 
a,a,*a,* 3 é 2) -23 18 I -4 52 37 14 -6 20 | —40| -30] —10/] —54 120 
454;a; ‘ ‘ Te 5 | —12 4 1| -—27 | -20 -7 ‘ -5 24 18 12 17| -72 
@,a," e . ‘ > 2 ‘ $ 3 I pe ‘ -4 -3 -2 -1! 12 
a? -4| -4 8 -8 4 -8 8 - -4 2 4| —12 + é 4 = 
a,'a;a,; 9 I -8 1° -4 26 | -26] —34 3 25 —18 -1 44 ~ =~ a 3 
*a,* ° 8 | -12 16 | -1 ; -4 I 9| -18 22} -1 2) -1 18 
a,' a,a,* -9 -7 8 -7 1 I 15 I —62 er ; 2] -40 45 32 3 -72 
a,*a;* 3 : 3 I 384 -—31 =$il'=3 23 3 5 6] -2r/ -15} - 30 
44," a, ne -9 5 P I s| -—14 -2 26 c -1 —4 18 | -16| —36 36 
@,4,*a;" . 3 2] -4] -6] -20 15 34 | —20] —39 18 2| -40/ -9 62 39 | -72 
@,a,a,*a 9 6 -8! -19 24 -1 3 16 | -—36| —51 . 4| —16 9 24 111 |—144 
4;0,a; -3 ° 4 7| -12 7) -1, | -27 73 68 | -12 5 48 | -—33 |—100 |-108 | 240 
@,a;,a,* é 2 : -1 ‘. 3 5 §| -22| -17 ¢ -5§ -8 I 32 17| -72 
a,a,* ° -2 4 “4 4 ‘ ‘: . —4 -2 -6 12] -—12 12 -8 12} —12 
a,a,5a;* ° ° -2 -1 -1 3 -5 39 16 6| -15 241 -39 10 | —76 120 
a,a,*a;* ° ‘ —2 12 4 -1 5 | —54] —20 : 5 | —20 45 20 72 |-—180 
4,0,a,; F ° “ —2 P ‘ -I 23 8 ° 4{ —-21 | -14/] -—18 84 
aa," ° ‘ ° é ¢ i . -3 -1 ‘ ‘ 3 2 1] —12 
a,* 3 3 7% =3 3 -3 6|- -3 -3 3 3 +9 —- 6; - —_ a 
a," a, a, -3 -3 4 7| -12 7| -18 13 19 | —27]| -12 13 16 | —42 2 37 | - 
a;*a,* Z P —4 -1 6 2 3) —1 -6 18 -2 -5 8 17 | -—30| —18 40 
a;*a,* e Z —2 2 —2 -1 3 -5 9 -9 6; -11 8 -3 -6 24 | -24 
@;*a,*a,* 2 2 7) -4 2 —4 9| -5| —30 54 9| -s| —28 $3 —45 |-114| 180 
@,*a,a;* 9 6 4 Z - ‘ —3 10 13 | —45 : s| -4|- 7° 7° |—180 
a,*a,* go 20 15 6 zs pe 7 -2 -1 9 Pe é 2 3 -17 -9 2 
@, a, a, 6 9 4 . . Z as S| ‘#9 9 -6 1wo| -4] - 20 | —49 
@,a,'a;* 24 24 15 3 ° 3 I -5 14 | -30 é -5 16 | —a1 e 130 | —240 
a,a,*a,* 81 7° 17 2 10 5 Z -7 27 F 3 -4 18 | -—32| -—87 252 
3 4,a; 252 210 161 7° 15 3 21 7 Zz -9 ‘ ° -3 16 19 | —96 
a,a;° 756 630 504 252 72 12 84 36 9 Z ° Ps ° ° -2 -1 12 
6 20 ° 15 ° . 6 ° ° . ° Zz -2 2 -2 2 -32 2 
a,'a;* 70 I 50 10 . 17 5 a . ‘ 2 Zz —4 9] -—16 25 | —36 
a,‘a,* 228 228 160 52 6 $2 24 4 é 6 4 Zz -6 20} —50 105 
a,'a;* 7 498 207 42 165 9 2 3 é 20 1 6 Z -8 35 |—112 
a,*a;? ose 1932 | 1526 736 ges 532 33 12! 26 2 7° 5 28 8 r| -10 54 
Qa; 300 © | 4620 | 24) 10 | 1722 | 1170 | 525 145 2 252} 210] 120 45 10 zr] -12 
a," 18480 18632 13860 | 7920 | 2970 | 5544 | 3 1980 660 132 | 924| 792] 495 | 220 66 12 Z 
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Table 3.10 





w =10 (i) 
















































































Ayo a, a, eG, | aa; Qy@y | G7G,Q, | G,Q,° | Gedy | GeQya, | GeQy* | ayQza;*| aeay* ae | aaa; 
hip a 2 2 -3 2 -6 4 2 —-6 - 12 = I —6 
Ighy I ps —2 “ -2 3 : -2 “ 6 me : -2 
Iighs I -1I * —2 2 * ; -2 5 -3 * ° 
hgh; —! ‘ ° 2 ° ‘ ° 2 —3 . . 
hyhs I -2 I Hi -2 " 4 -2 ‘ ; 
hjhyhy, -1 1 : ; 2 -2 i . 
ph I > : . ‘ -2 . « 
I -2 -1I -1I . —2 
highahy -1 ; : -1 x ‘ 
hgh, -1 2 -1 : : 
Aghgh;* I -1 : - 
hele -1 > é 
s I ~- 

hghyhy po 

w=10(ii)| GsGpaq | asaya," | apa,*a; | asa,a;*| asa, | aga, | aray® | ayas* | ayasQr0;| ayaza,;* | agar? | aya*a,*| aa,a,*| aa,* 
hip -6 12 12 —20 6 -3 6 -3 —20 4 —jo 30 -7 
hyhy ° 6 3 —12 5 ‘ 3 ° %4 —12 > —12 20 -6 
highs -2 2 6 —10 4 -1I I ‘ 6 -6 3 —15 17 -5 
hg hy é 2 ‘ -6 4 ; I RN M -6 . -3 12 -5 
hyhs —2 3 4 -8 3 ‘ . -2 10 -6 : —12 14 -4 
hyhghy ° ‘ 2 -5 3 : x 2 -—2 é -6 10 -4 
hh? é ‘ -2 3 ° r ‘ ° -2 P ° 6 —4 
hgh, 4 2 —6 2 -2 3 -1 10 -10 3 -15 16 —-4 
hghghy 2 -4 2 ° : . 2 -3 ‘ —-4 8 -3 
hgh;* ’ 2 -4 2 . . 2 -7 8 -3 

hhh? ° : é -2 2 . ‘ m é ‘ -2 5 - 
hahy* 3 , : : 2 3 i . ‘ ‘ : 2 se 
h,? —4 6 6 -8 2 ° 4 8 —12 ‘ —12 16 —4 
hshyhy ° 2 I -3 I ° 2 2 -7 ° —5 10 —3 
hshghy -1 I 2 -3 I ; 2 -2 -4 6 -2 
Isha I ‘ -2 I ‘ : ‘ -2 = 4 -2 
hsh,* A 1 -2 I . é e ‘ -2 4 —2 
Isha, -1 I ¢ ° : ‘ ° ° 2 —2 
hsh,® I ° ‘ , ° ° . : —2 
heh, —1I I 4 —4 2 -8 8 -2 
h2h,* I . -4 A -2 6 -2 
hyhs* -1 4 -2 Py —4 4 -1 
hhghahy I -1 ° -2 3 -! 
hee —1I : . 2 -1 
4 I —3 3 == 
hhh? -1I 2 -1 
hgh, hy* I =% 
hh! -1I 
w= no(iii)} as*@, | a@s%aq* | ay*a_a,*| as2a,* | @3a,%a; | asas*a,*) a,a,a;*| asa,” as® | asta;* | aPa;* | a,*a,* | aa," | a," 
hyo 4 6 —30 15 —20 60 —42 8 -1 15 —35 28 * I 
hyhy I ° -12 10 -4 30 —30 7 ° 5 —20 21 - I 
highs ° 3 -9 6 —12 32 —26 6 -1I Ir —25 22 -8 I 
hgh ° ° 6 . 12 —20 6 ° I —1I0 15 - I 
hyhs 3 3 -1 8 —10 37 —28 6 ‘ 8 —24 22 - I 
hyhgh, . : -3 3 —J 15 —27 5 . 4 —14 16 r? I 
hy,h? . ° a 3 . 3 —12 ; ; —4 10 -6 I 
Ighy 2 I -1 9 -8 3 —28 -1 —24 22 -8 I 
Kghsh, I ‘ - 5 -1 I -19 5 ‘ 2 —f2 16 -7 I 
hg h,* ° I -2 I -6 16 —14 4 -1 8 -1 17 -7 I 
hehyh,* : -1 1 : 6 —10 4 , I “yy II —6 1 
hgh,* ° p 1 x ° -6 4 ‘ ° -1I 6 - I 
h,* 4 —12 9 -12 34 —28 6 ¥ 9 —24 22 - I 
Ashghy ‘ —4 6 -2 15 —19 5 x 3 —13 16 ag! I 
hshghs 2 -5 3 -7 19 —16 4 6 -17 17 -7 1 
haha} @ -2 3 e 7 —12 4 @ ° -6 Ir -6 I 
sh, A ° ‘ “ -2 7 -8 3 rn 3 —10 12 -6 I 
hshyh, . ‘ s 2 -§ 3 ° ° —3 7 =§ I 
hgh, e ° ° A -2 3 ° . ° 3 -4 1 
hehe -4 4 -4 16 —16 4 -1 7 | -17 17 -7 I 
hf@h,* P - . 4 ‘ 4 —12 4 = I -6 Ir -6 I 
heh 2 I —1r 5 —4 22 —18 4 : 4 —16 17 an I 
hyhghah, : is -2 2 1 8 —r10 3 f 2 -~9 12 - I 
hyhyh;? . . ‘ 2 . I -7 3 x ‘ -2 7 ~ I 
hgh, ° . : : -2 6 -6 2 -1 6 -13 " - I 
hhh? . . : ‘ Z 2 —4 2 ° I -5§ =~g I 
hghahy ° ‘ ‘ ‘ bs e -2 2 ° -1 4 —4 1 
yy : ; : ’ ‘ é 2 . ; I - I 
1 I . ~ 3 . 12 —12 3 ° - 12 - I 
h,*h,* I -2 I -4 10 -8 2 4| -12 13 -6 I 
ha*hyh;* -1 I ‘ 4 -6 2 ‘ - -4 8 -5 I 
ha*hy* I ‘ 4 —4|° 3 e ‘ . 4 -4 I 
hh? hy ~% 3 -3 I . 2 me 9] -s I 
hsha*h? I -2 1 ‘ ‘ —2 5 —4 1 
Aghyh, -1I I . ‘ ; 2 =e I 
Re I 2 ; . -2 1 
—3 5 —10 10 -§$ 1 
hyth,* I 4 6) -4 1 
hy*h,* =e a =p I 
hyth,! I -2 I 
s 

me a 
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Tables of symmetric functions 


Table 3.12 (cont.) 
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Moment constants for the distribution of range in normal samples 


By H. O. HARTLEY anp E. 8S. PEARSON 


Given a random sample of n observations 2}, ..., 2, arranged in ascending order of magnitude, drawn 
from a normal population with unit variance, we 2enote the range by w = x,—2, and its probability 
density distribution by f,(w). 

Tables of the probability integral of range 


w 
P,(W) = Salo), 


as well as of its percentage points, are available (see, for example, Pearson & Hartley, 1942). However, 
the numerous applications of this statistic frequently lead to distributions derived from that of range. 
For example, the mean of k independent ranges is often used for the estimation of the parental standard 
deviation o. The distributions of such derived statistics could be obtained from the table of P,(W) by 
numerical processes of quadrature (see, for example, Lord, 1947). In order to avoid the labour involved 
in such procedures it is often possible to use analytic approximations to f,,(w) (Cox, 1949; Florin, 1950; 
Patnaik, 1950) which permit the analytic evaluation of the distribution of the derived statistics. In order 
to facilitate the choice of such approximations and to provide checks on their accuracy it was felt that 


a systematic tabulation of the moments of f,,(w) would be helpful, apart from the independent value of 
putting them on record.t 



































4 o He B; Bz Hs ba Hs Hs Ks, Ks Ke 
1-12838 | 0-8525 | 0-72676 | 0-9906 | 3-869 | 0-6166 | 2-044 | 4-46 | 13:3 | 0-459 | —0-03 | —1-22 
1-69257 | 0-8884 | 0-78922 | 0-4176 | 3-286 | 0-4531 | 2-047 | 3-44 | 11-1 | 0-178 | —013 | —0-41 
2-05875 | 0-8798 | 0-77407 | 0-2735 | 3-188 | 0-3561 | 1-910| 270 | 94 | 0113 | —0-06 | —0-15 
2-32593 | 0-8641 | 0-74661 | 0-2174 | 3-169 | 0-3008 | 1-766 | 2-22 | 81 | 0094 | —o02 | —0-07 
2-53441 | 0-8480 | 0-71916 | 0-1892 | 3-168 | 0-2653 | 1-688 | 1-91. | 7-2 | 0-087 | 40-00 | —0-05 
2-70436 | 0-8332 | 0-69424 | 0-1742 | 3-174 | 0-2415 | 1-530} 1-69 | 64 | 0-084 | +002 | —0-03 
2-84720 | 0-8198 | 0-67213 | 0-1657 | 3-184 | 0-2243 | 1-438 | 1-53 | 5-9 | 0-083 | +002 | —0-02 
297003 | 0-8078 | 0-65262 | 0-1608 | 3-191 | 0-2114 | 1-359 | 1-41 | 5-4 | 0-081 | +003 | —0-02 
3-07751 | 0-7971 | 0-63531 | 0-1580 | 3-200 | 0-2013 | 1-292} 1-31 | 5-0 | 0-081 | +003 | —0-00 
3-17287 | 0-7873 | 0-61984 | 0-1564 | 3-205 | 0-1930 | 1-231 | 1-23 | 47 | 0-079 | +003 | —0-01 
3-25846 | 0-7785 | 0-60601 | 0-1560 | 3-213 | 0-1863 | 1-180| 1-16 | 44 | 0-078 | +004 | —0-00 
3-33598 | 0-7704 | 0:59353 | 0-1559 | 3-220 | 0-1806 | 1-134 
3-40676 | 0-7630 | 0:58217 | 0-1561 | 3-225 | 0-1755 | 1-093 

Definitions 

3-47183 | 0-7562 | 0-57186 | 0-1568 | 3-231 | 0-1712 | 1-057 a re 
3-53198 | 0-7499 | 0-56237 | 0-1576 | 3-237 | 0-1674 | 1-024 pe f eatieah ites ial an i) w— pt)? f.(w)dw; 
3.58788 | 0.7441 | 0-55363 | o-1588 | 3.242 | 0-1641 | 0-994 | “t= Jq W/a() m5 Me = J, (w— Mi)” Sal) 
3-64006 | 0-7386 | 0:54554 | 0-1598 | 3-248 | 01611 | 0967 | = Juss B, = 8], By = melds 
3-68896 | 0-7335 | 053802 | 0-1612 | 3-254 | 0-1585 | 0-942 Vilas Ba = wb lads Ba = malt 

Kg = Hg — 31, Ks = Hy — 10g Ha, 
3-73495 | 0-7287 | 0-53097 | 0-1627 | 3-259 | 01561 | 0-919 | eg = 4g — 15 p44 4g ~ 103 + 30 p23. 
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The table above was derived from the 5-decimal manuscript of the Pearson-Hartley table by numerical 
quadrature. Its accuracy and scope is limited by the fact that the higher moments require an increasing 
accuracy in the tails of f,(w). From this point of view it was found most convenient to introduce suitably 
chosen ‘working means’ w*, depending on n, and to evaluate the moments yf about them by applying 
quadrature to the formula 


m 
= (-wpy +f (w—wy)*(1—P,,(w)) dw. 
0 ° 
+ Certain results up to n = 12 have already been tabulated by Grubbs & Weaver (1947). We note 


certain discrepancies between their results and the present table. The method of computation is not given 
in their paper. 
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From these moments, the moment-constants defined in the table were computed. The values of y} 
are exact; the remaining values may be in error in the last figure quoted. 

For expert help in the computational work we are indebted to Mr T. Vickers of the Mathematics 
Division of the National Physical Laboratories, while the moments about the mean and the cumulants 
were calculated from the u* by Miss E. R. Fitch. 
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Rapid methods for estimating correlation coefficients 


By L. N. CHOWN anp P. A. P. MORAN 
Balliol College and the Institute of Statistics, Oxford University 


If 6, is a consistent estimator of a po ulation parameter 6 and §, is a most efficient estimator of 0, the 
efficiency of 6, i is defined to be var (9,)/var (8,). In practice, however, the times required to calculate 
6, and 6, may be very different, and from the point of view of the cost of computation, especially if an 
indefinitely large amount of data is available, it may be better to assess the efficiency by multiplying the 
above quantity by ¢,/t,, where ¢, is the time or labour required to calculate 6,, and ¢, the time or labour 
required to calculate 6,. In the present paper we consider two cases of this kind. In the first part we 
describe what appears to be the most rapid method of estimating a product-moment correlation coefficient 
yet invented, and in the second part we consider a method of estimating the first-order serial correlation 
coefficient in a sample of a time series generated by a simple Markov scheme. 

Suppose we have a sample {z,,y,}(i = 1,...,n) from a bivariate normal distribution with unknown 
standard deviations and correlation coefficient p. The sample product-moment correlation coefficient 
ris an asymptotically most efficient estimator of p but requires a fair amount of computation. Kendall’s 


rank-correlation coefficient ¢ is easier to calculate for small n, and ; sin“ is an estimator of p which is 


consistent and has an efficiency 97-* in large samples for values of p near zero. However, for large n, it is 
probably more complicated to calculate than r. It is, however, possible to find another correlation 
coefficient, of ranking type, which is extremely easy to calculate and has an efficiency, when used as an 
.estimator of p, for p near zero, of about 0-33. 

Write sgn z = + 1 as x2 0 and sgn 0 = 0, and define 


1 - 
es "S a,enbuean (1) 


where a, = sgn(x,—2;), by = sgn(y,—Yy;). 
g depends on the order in which the pairs (x,,y,) are given, but if this is random, 
E(g) = E(a,, 64155 441). 


But z,—2,,,; and y;—Y;4,; are jointly distributed in a normal distribution with correlation coefficient 
p and so (compare Kendall, 1948, p. 114) 


2. 
E(g) = EO, 6415s, 441) = sin p- (2) 
As an estimator of p we could therefore take r, = sin }7g. From (1) we also have 


n—-1 n—2 
(n—1)* E(g*) = £ (= 64108 ta + 22 6, 14186, 641% e41, 1420441, 1424 ae Dy, 54104, 6420s, 94105, ra} (3) 


li—j|>1 














ee 
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Clearly E(a?,,,) = £(63,,,;) = 1, and in his investigation’ of the variance of the rank-correlation 
coefficient ¢, Kendall shows (1948, p. 115) that 


RSS ae 
EG, 41.05, 641 Ces, 420641, 242) = acai «: Beit, *~ sal tp +> 
If | i~j|> 1 we also have 
2. 3 
1, 54185, 64105, 54155, 541) = (EAs, 64104, cn)? = (=sin-*p) . 


Substituting in (3) we have, after a little reduction and the subtraction of {H(g)}*, 
1 2. 2] An-2)fT1 (2. s 
= —— ] 1-[sin- <— {= sin-1 
var (g) = o. [ (sin r) |+ (n—1)? | 9 (Fin iv) [; (4) 


lIn-13., ll 
9(n—1)2 ~ 9(n—1)° 








Thus if p = 0, var (g) = 
Since | sin-14p | >| 4sin-"p|, we have 
se pin | 2 . 
<— [= sin-1 <1y—(“sin-1 = 1/1 — 9), 
; (Fain ie) <5(! (sin r) | = $(1—g") 
Hence we have the approximate inequality 
lln—13 
9(n — 1)? 


In the neighbourhood of p = 0 we have ér, = 47dg and so 


117? 
var (r,) = bntvar gs" 


var (9) < (1-9). 


Then since r and r, are consistent estimators of p, we see that the efficiency of r, as an estimator of p, is, 
in the neighbourhood of p = 0, wet 36 


var(r,) 117? 








= 0-33, 


and thus the use of r, involves throwing away about two-thirds of the information. Nevertheless, if the 
pairs of observations are in random order, the labour of calculating r, from 3n pairs of observations is very 
much less than that of calculating r from n pairs. 

Cochran (1937) describes another rapid method of estimating p.* If % and ¥ are the means of the 2’s 
and y’s he finds the proportion of cases in which sgn (x, —%) sgn (y; —Y) > 0, and equates this to 


$+7-—sin— p. 


He shows that in the neighbourhood of p = 0, the efficiency of this method, when used as an estimator 
of p, is 6-405. However, the present method appears to be much faster and therefore preferable in cases. 
where the pairs of values are given in random order. 

Now suppose that one of the variates, y say, is given in a prescribed order. We can still test whether the 
x’s are independent by showing that in the universe of n! equiprobable permutations of the 2’s, g has 
expectation zero and a variance depending on the y’s but with a usable upper bound. For in such a case 
we obviously have 

E(4, 441) = 9, 


Ey, 641% 41, 142) = — 4, 
and so E(g) = 0, 


Qn-2 
(n—1)* var(g) = (n—1) -3 2, Be 641 Op4s, 42° 


The sum on the right-hand side could be calculated in any given case. However, we shall always have 





n—-2 ; 
2 b;, t+1 Bess, #+2 





< (n—2), 


* [This method has sometimes been described as ‘Sheppard’s method’ of estimating a correlation 
coefficient; it follows from the work in his 1899 paper, Phil. Trans. A, 192, 101. Ep.] 
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and so (n—1)* var(g)<(n—1)+3(n—2) = 4(5n—7), 
5n—7 
and thus . var (g) < 3(n— 1)” (5) 


which will often provide a reasonable test. 

The exact distribution of g for small n and all permutations of x equiprobable has not been found either 
for a fixed or a random order of the y’s. Such a distribution would be interesting but probably not of 
much practical value, for if n is small the loss of information is so large that the present method is scarcely 
worth while, and it would be better to use a product-moment or a ranking correlation coefficient. 

A very similar type of method may be used for the estimation of serial correlations. Consider astationary 
time series generated by a relationship of the form 


(X44 —™) = p(x,—M) +&, (6) 


where m is unknown and {€,} is a sequence of independent random variables which are normally dis- 
tributed with zero means and the same, but unknown, standard deviations. Write 


X, = sgn (x,4,— 2%) Sgn (%j42—Xj4,) (t= 1,...,.n—2). 


Put S=— 2x xX, (7) 
n—2 4-1 
Then if p = 0, we have E(X,) =-}, (8) 
n—-2 2 
and E(S?) = (n—2)"°# ( z x,) 
1 


n—-2 n-3 n-4 
‘i (n—2)-*B| EX242'D X,Xi+2D X,Xuet LE X,X,). 
1 1 1 


\i-j|>2 
Now E(X?) = 1, and if |i—j| >2, E(X,X,) = {E(X,)}* = }. 


By enumerating all of the possible 4! and 5! permutations respectively it may be shown that E(X,X;,,) = 0 
and E(X; X i+2) — is. Then 


E(S?) = (n—2)-*{(n— 2) + x45(n — 4) + §(m — 4) (n — 5)} 


ie 5n? + 12n — 38 
~ 45(n—2)2 ’ 
32n—58 32 


and var (S) = 45(n—2)* * 45m" (9) 
The calculation of S is simple and quick, and if n is reasonably large provides us with a useful rough test 
of the serial independence of the sequence (2, ...,%,). It may be noticed that this method is related to the 
calculation of Kendall (1946, p. 382) of the expected number of peaks in a stationary time series, the 
difference being that we here use both peaks and ‘troughs’, and also consider the variance. 

We now consider what happens when S is used to provide an estimator of p in a process generated by 
a relation of type (6), with p+0. Consider any sequence (2,23, x3). Then (2,—2,) and (x,—2,) are easily 
seen to be jointly normally distributed with a correlation coefficient —4(1—). It follows that 


pr. {v,—2, >0, 73-2, > 0} = pr. {a,—27, <0, 73-27, <0} 


1 1 p-l 
= — aus tomas -1 
=3(1 = 008 ( F )}. 


Thus E(S) = E(X, = 1—= cos (5 *). (10) 








We therefore take as our estimator of p, Pp = 1+2sin 47S. 


We have not been able to find the variance of S when p +0, but as it is clearly a continuous function of p 
in the neighbourhood of p = 0 we can find the efficiency of f as an estimator of p relative to a maximum- 
likelihood estimator when pis small. / is clearly a consistent estimator, and when p is near zero we can write 


6p = mcos 4nS8S, 





ee SS OOOO 
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so that var (() + 7° cos? 47 var (S) = val . 
15n 
On the other hand, the maximum-likelihood estimator of p is also consistent and has, near p = 0, a variance 
equal to * +0 (<) . Consequently, the relative efficiency of / is 


15 
— =0-19. 
877? 


This method of estimating p is not only rapid but gives us an unbiased estimate of a function of p, and 
is independent of m and the variance of €,. It may therefore be useful if we have to estimate p from a large 
number of short series. To do this we add together the values of S, each of which is an unbiased estimator 
of the right-hand side of equation (10). We are indebted to Mr E. J. Williams of the Australian C.S.I.R.0. 
for drawing our attention to the need for a test of this kind. Although we have not found the variance of 
S when p+0, if we are estimating p from a number of short series we can find an estimate of var (S) 
from the observed variance between the values obtained. 


One of us (L. N.C.) is indebted to the Department of Scientific and Industrial Research for a maintenance 
grant. 
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We are indebted to Mr D. E. Barton of University College, London for the following notes: 

The moments of S as defined in equation (7) may be derived for second and higher orders without 
recourse to the heavy labour involved in enumerating large numbers of permutations. For these moments 
are seen to be functions of terms such as E(X, X,,... X;,) of a simple form, and these expectations may 
be evaluated in the following manner. 

We take i, <i,<... <7, and find that: 


(a) If |¢,—7,,,|>2 forr = 1,...,n—1, then 
E(X,, Xi, »++ Xi) = {E(X,)}". 
(b) If (21, %9,..-,7,) contains a group (7,,%,41, +++»%,44) With the following properties, 
(i) %4941—%4~ = 1 for all S = 0,1,...,¢—1, 
(ii) neither 7,44,; —7%,4, = 1 nor 7,—7,_, = 1, 
(iii) either 4,.4,, —%)4,> 2 or 7,—%,_, > 2, 
then E(X,, Xj, -.- Xi,) — 0. 
(c) If neither (a) nor (6) are true, then E(.X, X,, ... X;,) is a product of such terms as 
Proaa = E{sgn (2441 — 2) 88M (p49 — 241) «+» 88 (Xi4m—Li+m—1)}+ 


Further, P,, = @m41,;/m!, where @_41,; is shown simply to be determined by the recurrence relations 


where a,,, = l. 
Thus P,;=-}, P, = +s, P, = -—#i5, P, = x84, 


and, from the symmetry of the recurrence, it is seen that P,, = 0 for even values of m. 
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Mean and variance of an entry in a contingency table 


By W. L. STEVENS 
Faculdade de Ciéncias Econémicas e Administrativas, Universidade de Séo Paulo 


Patnaik (1948), in a paper which examines the power function of the test for independence in a 2 x 2 table, 
derives certain formulae for the mean and variance of a tabular entry which may, under other 
circumstances, prove to be misleading. 

Using Patnail’s notation, we may represent the contingency table by 





acim 
b din 
rs|N 


If m and n are arbitrarily chosen, if the probabilities are equal that an event classified in either the first 
or second row will fall in the first column, and if we consider only those results for which r and s have given 
values, then it can be shown that the mean and variance of a are 


p=mr|N, (1) 

o? = mnrs/{N*N —1)}. (2) 

When the table is not ‘independent’, let us designate these probabilities and their complements 
respectively by “_ awiek. 
P» &@=1—-Py 

and write V = P192/(P2%) +1. (3) 

Then it can be shown that the probabilities in the distribution of a are proportional to 
Initriot 
Nratbietai¥™ () 


Patnaik seeks the mean and variance of this distribution. Following him, we replace the first factor 
of (4) by the ordinate of the normal distribution with mean and variance* given by (1) and (2) and 
write the second factor as exp (alog y). We deduce that the distribution of a is given by 





1 (a—p—a* log y)? 
o \(2m)~ ~ 20% } ” 
whence +e conclude that mean (a) = 4+o*logy, (6) 
variance (a) = a. (7) 


It is difficult to detect any flaw in this argument, but that the result is erroneous can readily be shown. 
Let us suppose, for example, that p, = 0-02 and p, = 0-98. Suppose further that m =n =r=s= 4N. 
It will be noted that we are not endeavouring to obtain a peculiar result by imposing unnatural restrictions 
on r and s. On the contrary, the values chosen are the most probable. 

Now suppose that N and hence m tend to infinity. It is evident that, notwithstanding the restriction 
on r and s, the proportion a/m will tend, in the probability sense, to p, = 0-02. Hence a/N tends to 0-01 
and the formula for the mean should therefore give 


mean (a) = 0-01N. (8) 
Using Patnaik’s formula, we find, when N is large, 
f= 0-25N, ao? = 0-0625N, 
¥=(d5)?, logy = —7-7836 (9) 
mean (a) = minus 0-2365N. 


Apart from being wrong, the result is evidently absurd—the mean of positive values cannot be negative. 


* It is open to question whether there is any advantage in using N*(N—1) as the denominator of 
expression (2) instead of the simpler N°. 
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We must therefore re-examine the argument by which these formulae were obtained. It is observed 
that, keeping the marginal proportions fixed and allowing N to tend to oo, the variance given in (2) also 
tends to oo. Hence, unless % = 1, the term o* logy in equation (5) will tend to +o. Consequently, in 
order to obtain finite values for (a — 4 — o* log ys), we must suppose that | a — | -> 00. This implies that the 
pertinent values of 

minirts! 

‘ Niatbictd! 
are those in which the values of a are increasingly remote from the mean. Now, although it is legitimate 
to replace the bulk of the distribution (10) by a normal distribution with the same mean and variance, it 
is quite wrong to suppose that this approximation will be valid in an increasingly remote portion of a tail 
of the distribution. 

It appears then that, for Patnaik’s derivation to be valid, we must suppose, not only that N + oo, but 
also that > 1. Since he is studying the power function of the test for independence, i.e. the test of the 
hypothesis that y = 1, the use of the formulae, in this particular contezt, is entirely walid. In other words, 
(6) and (7) are valid approximations in the neighbourhood of y = 1. As, however, this limitation does not 
appear in his derivation, the formulae may yet prove a trap for someone who uses them for other values 
of vy. 

We give below, without proof*, the formulae for the mean and variance of a. Let four quantities, 
74, 7g, 7, and 7,4, be defined by the relations 


(10) 


1, +,.+173+7, = 1, 
m+7,=7/N, 


(11) 
7, +73 = m/N, 
7, 1,4/,7, = Y. 
Then, in the limit, mean (a) = Nm, 
variance (a) = (12) 


1. did wel 


The variance is seen to be a natural extension of the variance of a binomial, when the latter is written in 
the form 





To return to the example examined earlier, we have 7, = 7, = 0-01 and 7, = 7, = 0°49. Hence 


mean (a) = 0-01N, 
as already given in equation (8), and 


variance (a) = 49N/9802 = 0-:005N approximately. 


it appears that the variance is about one-half of what it would be, if no restrictions were placed on the 
raarginal subtotals. 

To check Patnaik’s approximations, let us consider a less extreme example: 7, = 7, = 0-2, 7, = 7, = 0-3 
and all marginal subtotals equal, as before. Then, N being large, we have by the two methods: 








Proposed Patnaik 
Mean (a) 0-2N 0-19932N 
Variance (a) 0-06N 0-0625N 

















It appears that the results are in fair agreement in this instance, even although y( = #) is ‘a good way’ 
from unity. 


* See Kendall (1947), Vol. 1, p. 312, for a similar result. 
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In conclusion, we suggest that the problem of approximating to mean, variance and higher cumulants 
of an entry in a 2x 2 table with % +1, is one which merits further study; the ‘well-worn’ topic of the 
2x 2 table, far from being exhausted, is proving itself to be inexhaustible. 
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The Wishart distribution derived by solving simultaneous linear differential equations 
By H. 8. STEYN, University of Pretoria 


Let us first consider the multivariate normal probability function 
P= P(x, Ly, ---, Ly) = Coxp {— $(ay, 2} + 2ayg%, %y +... + ap, 23)}, (1) 


where the function inside the bracket is a positive definite quadratic form. On differentiation we have 
a s 
=> a6 = (44% + Og¢%g+ see $5, X,) D (a = a; Ruiz k). (2) 
‘ 


Now, if M=M (a, a, ...,@,) is the moment-generating function of (1) it is obvious from 


'cO io 9) @ 
ae = [ff enantio 
-o J —-@ -—2 


that a, M is the m.g.f. of -~$ 
P . (3) 
and = M is the m.g.f. of x,¢. 
‘ 
Thus, it follows from (2) that 
é é 7) P 
a,M = eg ting B+. tho we (4 = 1, By .<n<p8)- 
Solving the k equations in : o Mirae k), we have 
ving q Mea, me by Mg eeeg M)}y 
TM = fl Ane | y+ | Age | at + | Ag | eto +| Aca} 
Moa,  |Aji*u!% 2t| At... | ete. ik | Xph> 
Gy Aygq +++ Aye 
where the determinant |A|= Gyqg Ugg ++» Age 
Ait Aor eee One 
and | A,,| = | A,,| is the co-factor of a,, or a,,. Integrating we find 
1 l 
M = exp {5 ll Aulat+2| Aua|aita+..+|4su| ab), (4) 


after fixing the constant of integration. Similarly, by starting from (4) we may proceed to the probability 
function (1) by solving correspondingly k simultaneous equations. 

The same ideas may now be used to derive the Wishart distribution. Let 2,;,%9;,...,2,,(1 = 1, 2,...,.N) 
constitute a sample from a multivariate normal population with probability function (1). Let the joint 
m.g.f. of the $k(k+ 1) second-order statistics 


1 N N 2 
ales oe N-1 (2 22 (2%) /| 


1 (nN N WN \ 
and Veg OF Pug O 4s = wail 278" (= wy 2 ea) /1}, 





Se 
———— ——— ~ . 
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be given by 
oe @ o 
M (013, Xyq, ---5 Sex) ={ | | PiGs --- Py OXP (Oy, Vyy +O yQVigt -.. + Ope V px} Cay, days... deny, 
—@o —@ —-@ 
where P= (“yy Tq,---,%y), Andina, i<j. 
Now 


7] 
On 30, M+aq5— M+. +0u5—— M+. +045 M 


S i 1. of Pride +++ Py Misr + Dig Vjat --- +A gs jg +... +O yp Pp) 
—-oJ —o —2 
X OXP {01011 + ...} diy, 02,4... dxzy (where a,,=a,, and v,,=v,, and &,=,5) 


1 N Cs) © Cs) 
y-1, 3 ae: se $1 --- (i) --- Oy ‘hie PQ yt Dig lyyt ... + Ay %y_t... +0, 2) 


«(2n- | > Xy_/N ) OXP (Oy, yi +...) Uy | dary, ... (dazg) ... daryy.t 


Integrating the function inside the square benial by parts with respect to x,, and remembering that 
by equation (2) 





a 
PG %.+ QigXgyt.-.+ O41) aa ¢d, 
| this becomes Bae, 


cd 1 N N. N 
4 a OxP (01. %1 + IL aa (#0— 2 eat) 2a (eu x vu) edie x sy/W) 7 
N 
+ 2au(aa— zalN) +... $0(20— >»  ayin)| 4S 


where by{ = 1, ‘or 


Therefore, — ——— for all values of 1, we have 





rity dxq, 


a a 
a,——M+a a a REE —M+...+a,—M 
‘s a 1855 Oar, 4 Bay, ik Baty, 


on Fp <...4 
N—-1 0a; N- ‘fen 7 ii ; N- 
It is thus clear that we may write down the following —"s of simultaneous equations: 


Oey Ais a 4 oo 
(20-555) ec M+ (20 ~ widest + (2 N- rs et +(20- No i)iea™= 7 wr 
=0 (t+)), 
(5) 








7) 
ide, + Mee 





where 4,9 = 1, 2, coey KS Ay = A543 Hes =X yqe 


xO 8 . eee , 
Solving the k equations in Mia’ = 1,2,...,k, obtained by giving ¢ the k different values from 
ie 


1 to k, it clearly follows that 


rr gr 














12 y_l4el 
Mia, ~ [A]? 
2 1 
A 1 Ay yuo1™ On — 5 Me 
1 2 l 
where |A*|= ca ee Rc Dd 
1 1 2 
a <a Ay =": eee Apr y-1% 
and where | A}, | = | A¥,| is the co-factor of the jth row and rth column in | A*|. 


t $1 --- (b;) .-- dy means that ¢, is left out in forming the product of the other N—1 factors ¢,. 
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. a 2 : 
Further, since aa, |4°! =— Vay |Aeli Gr = 1,2, ...5k), 


it follows that log M = — ad 





=" log | A* | +1og 0, 


and M = C|A*|-*-), where C follows immediately from M(0,0,...,0) = 1. Next, to obtain the 
probability function y= yf (v4), V,95---» Vzx)s let us write down the differential equations in y corresponding 


to the system (5) above, using the relations similar to (3) above and also that tty = M is the m.g.f. of 
ir 


a a ahr 
Gp, Ce) - 5 ¥ (19) 


a 4 s 
= — 5 VV (t =9). 


Thus (eno Vong Pty Yt... +0as—#) [—1) | 
+ (Aap + int wee FOV) Y= < (i=) 
=0 (i+)). 
M1 «Vg On 
Clearly, writing |V|= M2 Veg +++ Une ; 
Un Var Ver 


1 8 
Y We 


and | V;,;| as the co-factor of v,;, we have, on solving the k equations in y (r = 1,2,...,%), obtained 
by giving j the different values from 1 to k: 


38 5, Be Ral lh Deel 7) 








Y dv,, 2|V| 
te) 
where —|V|=| Val, 
pI V = Pal 
" 1 @ (N —k—2)|V,,|—(N—l)a,| V | 
and for i+r, —-—y= ’ 
you,” i 


a 
h —— = 3|V,|. 
where av, | V| | Ver| 
On integration 
logy = #(N —k—2) log| V | —4(N — 1) (ayy 41 + 2919+... + Ope Vex) + log OC’, 


or y= C’| V |-*-Dexp { — 3(N — 1) (ay, Oy, + 2ayg% 19+ --- + OeeVex)}, 
where C’ is a constant. 


Quality control systems based on inaccurately measured variables 
By W. M. LONG 
Central Research Establishment, National Coal Board 


Cases sometimes arise where the characteristic which determines the quality of a manufactured product 
cannot conveniently be measured directly, but instead must be estimated from the value of a related 
subsidiary characteristic as, for example, when the direct test of quality involves destruction of the 
article. The relation between the two characteristics will often be of a more or less loose nature, so that 
the estimation of the main characteristic from the subsidiary will be accompanied by errors, in which 
case we may sum up the situation by saying that the measurements of quality are inaccurate. 

In the following note are discussed the measures necessary to meet this situation. 
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1. Let x be the estimate given by the approximate method of measuring quality of y, the true value. 
Assume that 


(i) the regression of xz on y is linear; 


(ii) the successive observations x from the production system, with their associated y’s, generate the 
bivariate normal distribution 
1 1 
Pmex SF ee, a —g2 
no OoP — 958 +9? —2 V(1—0%) zy} dady, 
the variates x and y being measured in terms of their standard deviations from their mean values for the 
system; a? is the residual variance of the regression of z on y and is equal to (1—,*), where p is the 
coefficient of correlation between x and y. 
The conditional distribution of x, given y, is then 


N{ (1-07) y, 0%}. (1) 


Clearly in the bivariate system any change in the marginal distribution of the y’s must be accompanied 
by a corresponding change in that of the z’s, provided that the distribution (1) remains unaffected. We may 
therefore use the marginal distribution of the x’s as a check on the stability of the production system; 
i.e. we may base the quality control system on the «’s, precisely as if they were direct measurements of 
quality. 


2. It is not of course to be expected that a control system based on the z’s will be as efficient as one 
based on the y’s. To see how the efficiency is reduced we may consider the respective powers of the z-tests 
and y-tests with regard to a set of simple hypotheses. Instead of considering the number of defectives 
in the test sample, as is usual, we shall for convenience deal with the displacement of the sample mean 
from the hypothetical mean for the system; the principle of the argument remains unaffected. 


Let the acceptance region (or one of them) in the test be the interval (- 7 =) » where 


— Jn’ Jn 
eal ai = (1—a), say, 


the test being based on the mean of n observations, so that if a sample mean 7 falls outside the above 
interval a change is assumed to have occurred in the production system and action is taken. 

Suppose a change has occurred, in that the average level of quality has shifted from 0 to , the variances 
of x and y, and the conditional distribution (1) remaining unchanged. This implies a shift in the x-mean 
from 0 to ./(1—o*) € = pé, the correlation coefficient remaining unchanged. 

The power of the x-test in detecting the change is 


Lol 
- /F " e-in@-pi'dz = |] —-—_— 
27) alvin (27) —%y—-Vnp— 


The effectiveness of the x-test, compared with that of a test based on exact measurements, may be 
described by the ratio 


Ot lage e~te"de, 


1 Lo— Vn p& 1 [%-vng -1 
1—-—_—__ Waal {1-5 -~Was| . 
{ (27) anda (2m) —X-Vn Ee 


the denominator being the power of the test when p = 1, i.e. when the measurements of quality are exact. 
We shall call this ratio the ‘power factor’ of the z-test; the analogy with the definition of the efficiency 
of an estimator is obvious. The power factor is in this case a function of p, n, « and &. 

In Figs. 1 and 2 the full lines show how the power factor varies with the quantity ./(n£), for « = 0-01 
and 0-001 and p = 0-6, 0-7, 0-8 and 0:9. Itis unity at § = 0, +00 and —0o (this follows from the definition), 
and for values of p in the range considered has minima near ,/n£ = 2, i.e. § = 2/,/n. 

The practical significance of this may be gauged from the dotted lines on the diagrams, which show 
how the power of the exact test (case p = 1) varies with ./ng. Clearly the value of the power factor, for 
né less than 2, say, is not of great interest, since the power of the exact test itself is then low. 

The power lost through the inaccuracy of the measurements may be restored, at least partially, by increasing 
the size of the test sample. For the power of a test based on n, exact measurements is the same as that of 
one based on n, inaccurate measurements if ./ngp = 4/m, i.e. if ng = n,/p*. It will not of course always 
be possible to recover all the lost power in this way, since n, is limited in size, the limit being reached 
when all the output is being examined. 
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3. The joint distribution of x and y depends on five parameters, the means and variances of x and 
y respectively and the correlation coefficient p. So far in our control system we have considered one only, 
the x-mean; a second, the variance of x, may be covered by the ordinary quality control methods which 
include a check on the variance of the control variable based on the sample range. This leaves three 
parameters, and it will be seen that these are covered by our assumption that the distribution (1) remains. 
unchanged. 

This assumption will often be a reasonable one to make, since the relation between x and y will usually 
be a consequence of the fundamental natural properties of the material, properties which may be quite 
stable and insensitive to changes in any particular system of production. 

In some cases, however, it will be advisable to carry out periodical checks on the stability of the dis- 
tribution (1). This will require special samples in which both x and y are measured. The following is an 
appropriate significance test. 
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Fig. 1. a=0-01. Fig. 2. «=0-001. 


Let (x;,,y;) be a typical pair of observations from the test sample, there being 7 pairs in all, say. If 
n 
the regression system is stable then S = & (x,—py,)*/o? is distributed as x? with n degrees of freedom. 
i=1 
Let (x—Z) = b(y—¥Y) be the line of best fit for the sample. Then 


X(x,— py;)*?/o* = XU[x,—Z—b(y, —¥) ?/o? + n(Z— py)?/o? + (6—p)* L(y,—¥)?/o* 
= S; + S; + S3, say. 


The quantities S,,S, and S, are, under the assumption of regression stability, independently distributed 
as x* with degrees of freedom (n—2),1 and 1 respectively. By entering the values obtained in the y* 
table with the appropriate degrees of freedom they will therefore provide tests for changes in the residual 
variance and the two parameters of the regression line, respectively. 

We should also test S itself, for S may attain a significant value even though each of its components 
fails to do so. In this we should still conclude that something was amiss, without being able to specify 
exactly where among the various parameters a change had taken place. 

A little care is necessary in judging the results; S, shc uld be tested first, for if a change in the residual 
variance has occurred the tests for the other two parameters no longer apply. In this case the appropriate 
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tests for the line parameters are the F'-ratios F,,,, = (n— 2) S,/S; and F, ,_, = (n—2)S8;/S,, where the 
affixes have their usual meanings in terms of degrees of freedom. A collective test 
Ps, n-2 = (n— 2) (S,+S83)/(28,) 
may also be applied, for reasons similar to those which lead us to test S in the above. 
4. From the above discussion of the power factor it will be apparent that there is nothing sacrosanct, 
for. quality control purposes, in direct measurements of quality, even though to obtain them may be 
perfectly practicable. The same power in testing may often be obtained through the use of larger numbers 


of indirect measurements, and the latter procedure will sometimes be preferable on the grounds of 
convenience and cheapness. 


The bivariate normal integral 
By J. H. CADWELL 


1. In order to evaluate the bivariate integral over a rectangle with sides parallel to the axes, we can 


use the function 
1 sis i (x* — 2pay + y*) 
M(h,k, p) = ———_— ——————__ dxdy. 
aay mols A eae 


Tables will be found in Tables for Statisticians and Biometricians, volume 2 (Pearson, K. (1931)). 
Linear interpolation is adequate for small values of p. However, for | p | > 0-5, second differences in p are 
needed for four-figure accuracy. 

If the rectangle R is bounded by the lines 


w=h, toh, yok, y=k, 
the integral over R will be equal to 
M(h,, k,,p) + M(hg, ky, p) —M(hy, kg, p) — M (hq, ky, p). 
The following results are needed for negative h and k: 
M(h,k,p) = $— ®(h)—M(h, —k, —p) 
= M(—h, —k,p)+ ®(—h)+ O(—k), 


where (x) is the normal integral from 0 to z. 

2. We show below that the bivariate integral over R, or, indeed, over any polygon, can be evaluated 
by using the function 1 fh pach 

Vina) = 5 | [exp —Hat +¥") dey. 
27 0/0 

As this depends on only two variables, a much smaller table will suffice. Such a table, giving 6 decimal 
places for V(h, q), has been prepared by Nicholson (1943) for arguments h = 0-1(0-1) 3-Oandg = 0-1(0-i) 3-0. 
Should this table not be available, the approximation derived below will give three-place accuracy for all 
values of the variables. 

Nicholson also found an asymptotic formula; this will give results of high accuracy for large h and q. 
An alternative derivation is given below. 

We see that V(h, q) is the bivariate integral with p = 0, taken over the triangular region defined by the 
points (0,0), (h, 0) and (h,q). By virtue of circular symmetry, the integral will be unaffected by rotation 
of this triangle about its vertex at (0,0). The result 


O(h) D(k) = V(h, k) + V(k,h) 
follows readily. On putting h = k, and using the approximation for V, an improved version of the result 
1 Qar?\ \+ 
O(2)=5 (1 —exp (- =)| 
is obtained. 
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3. In order to evaluate the integral (9 +0) over R we proceed as follows. 
Using the transformation 


w=x, z= (y—pz)/(1—p*), 
the integral over the shaded triangle is found to be 


ki—ph, |. 
| V(has Qu) — VAs 9n) |, a = ee (¢ = 1,2). 
A similar transformation gives for the triangle OAB 
hy— , 
| Viki» Pu)— Viki Pa) |, Pa = ie (@ = 1,2). 


For negative values we need the results 


V(h,k) = — V(h, —k) = — V(—h,k) = V(—h, —k). 


B (hy. k;) A (hy. ky) 











ae 


J _ ie 
13411 


D(h,, kz) 








Fig. 1 
In the above figure the two integrals are to be added, and two similar integrals taken from their sum. 
When the origin lies inside R, all four integrals will be added. 
With the obvious definitions of p,,, etc., the integral over R is 
| Vy 911) + V(Peas Ga) + V(ky, P11) + V(Keas Pag) — V (hy, G21) — V (hres a2) — V(x» Por) — V(keas Pra) |. 


However, it is more convenient to evaluate the four portions separately, and then combine them suitably. 


4. Using the tables for V(h,q) we can evaluate the general integral over any polygon. A rotation of 
axes followed by a change of scale will reduce ths problem to that of evaluating 


l 
s;| | exp— Het tv )dedy. 





This integral is the sum of a number of integrals over areas of the type shown shaded in Fig. 2. 
iss | 
*K ‘ 
Bd qu q2 
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Fig. 2 


Because of the circular symmetry, this area will contribute 


V(h, 2) re VA, %)- 
Thus the problem is solved. 

















Miscellanea 477 


5. In order to obtain an approximate formula for V, we replace the triangular area by the sector 
of a circle shown in Fig. 3. The integral over the sector is 


a 
5 {1-exp(—4R%)}. 


(h, 9) 


x\ 


a Y 








A 
Y 





R > 
Fig. 3 
Throughout X the integrand is smaller than at any point of Y. Thus the integral over the triangle 
will be smaller than that over the circular sector. However, if « is not too large, we assume that the in- 


tegrand has the same constant value over X and Y. Then we choose R to make X and Y have equal 
areas. This amounts to making the triangle and sector have the same area and gives the approximation 


tan" q/h —hq 
‘h, q) =———-_{]l-— —————_ }. 
¥(h.9) 27 ( exp 2tan-} a} 
For values of g>h we evaluate V(q,h) and use the result 
V(h,q) + V(q,h) = D(h) O(q). 
The error of this approximation, after a change of sign, is 
1 f@ —h? aw-2(1 . —h*? tana 
2m 9 » 2c0s*6 ~ Oar Poa ‘ 


Differentiating with regard to a and h we find that it is a maximum (in the region q<h) for 


2 


An approximate solution gives h = 1-76, with a maximum error of 0-0015. Fig. 4 shows the error 
graphs for g = h and g = 0-75h. It is evident that, in general, errors will be much less than the maximum 
value. The value given ky the approximation is always too large. 


h 21 
= . —* _exp—h? (=-3) = O(h). 












Error 4 The approximation 
__tan-ig/h —hq_ 
0-001 +— q=h Vib. Q=—z, {1 —°*P Fean-igih 
0 bette 
0 1-0 2:0 3-0 40 h 


Fig. 4 


6. To improve on this approximation we expand the integrand about R as 





31-2 
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Now we choose R so that this quantity gives equal and opposite values when integrated over X and Y. 


This gives the following result: 
4hq(1+4R*) —hg(h? + 49g?) = 4R%a(1+4R*)—aR*. 
Assuming that R? = hq/a+e 
with e small, we derive the approximate formula 
2 2 aes 2 
niaas ={1- exe | = —h ne, ys tena (oy ee 3 tan =: 


This formula is not asymptotically correct because of the neglect of powers of € in deriving it. However, 
over the range, 0<h<3-5, the maximum error is 0-0003 at h = 2-4. Beyond 3-5 the simpler formula gives 
a smalier error that tends to zero rapidly. 

The correction term is troublesome to compute and is not needed for the small g. This suggests expanding 
the correcting factor about h = q, and gives the following rule for the exponential index: 


If h<3-5 —hq 
0-75h<q<h 2tan-gq/h 





+ 0-04h%q — 0-03h4, 
otherwise the simpler form is to be used. This procedure will never give an error greater than 0-0005, and, 


in general, will be much nearer the true value. 


7. We now derive an asymptotic expansion due to Nicholson (1943) for the integral 


I(h,q) = mel, So ~ 24 aedy, 
az 


By considering integrals over appropriate regions we see that 


I(h,q) +4®(h) = V +—cot-1= 
(h, q) + $O(h) (A, 9) 2 cot h’ 
Transforming to polars and using an appropriate change of variables, 


1 fin h? 1fe 1 —hX1 +2) 
Wha) = 5 [ oP - sarp = | im ss 





Here « and z are defined by x = tana =<. 
Repeated integration by parts then gives 





| pel —< 1 1 2 )+ 
~2n° area “planner are |; 


The general term inside the brackets is 


1 /(1d\""/ 1 ) 
mea) basa 


Putting x? = ¢ and using the Leibnitz formula gives 





(—1)"-1 (2.4...(2n—2) =21.3...(2r—1)(2r+2)...(2n—2) 1.3... (2n—3) 
h2n { 2( 1 + ay 1 Fata | 1 + z*)*-" vin-l(] +2) 


If S,, denotes the sum of n terms of the expansion, we have the relation 


1 f®d/ld\"; 1 AY +222) 
1= 8.4 55 | zz) (asa) P- 2 > 


From the formula for the nth term we see that this integral is of alternating sign. Thus, S,, will be 
alternately too big and too small, since the integral will take alternate positive and negative values. 
Beyond a point depending on h and 2, the terms begin to increase. We thus have the rule, that if the 
rth term be smallest, J will lie between S,_, and S,. Further terms cannot improve on this information. 
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8. The well-known approximation discussed by Pélya (1949), 


1 — 2a2\i 
O(z)=5 (1-exp—), 





arises from the formula of §5. Its maximum error is 0-003 at x = 1-6. Using the improved formula of §6 
1 2a? 2(7—3)a*])* 
(x) =5(! —exp |-= + ae || . 


We note that this formula is not asymptotically correct. However, over the range 0<2<3-5, the 
maximum error 0-0007 arises at x = 2-5. 
While the method used could be carried a stage further, it is easy to make empirical adjustments. 


The formula Ont 2 3) <4 
— 22 4 a9) 2" _ 9.000528 + 0-0000228 
1 37? 
for the index of the exponential gives a maximum error of 0-00005. 


Acknowledgement is made to the Chief Scientist, Ministry of Supply, for permission to publish this 
paper. 
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Note on the variate differences of autoregressive series 
By BENTO MURTEIRA 


1. The object of the present note is to give a simple expression for 


k. 
var (A*z,) a tan Fy (1) 
k-> (*.) k->0 
k 


when 2; is a series generated by an autoregressive scheme of order m, 
Up Oy Lypiy tH ... +X Lm = Cp (2) 


We assume that H{e,¢,,,} = 0 for v+0, and in this case it is well known that 


var (A*e,) = vare (7) 


,( 2k = (3) 
cov (A*e,A*e,,,) = - wea Fat mits 
0 k<s. 


Let us consider for fixed k the autocovariance generating function of the series A*x, given by 
Gy(z) = A(z) G,(2) 


ioe] co 
= LC = OMew 


uUu=—@ v=—0o 


= 5 Oe, (4) 


s8=-@ 
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1 1 
(Lay Z+ 2. FO 2™) (L022 +... +O Z-™)’ 
OF = H{A*e,A*e,,9}, 
OP = E{A*x,A*x,,,} 
and var (A*x,) = 0. 
The expansion of G(z) as a power series in z is to be regarded as a purely formal one determining the 


C’s in terms of a’s. It will, however, converge for z = + 1 if the series is stationary, for then all roots of 
the equation t” + a,t"-14...4+a,, = 0 are less than unity in modulus. 





Gz) = 














We can see easily that = 
oy = 2 0,c® 
8=-@ 
ao 
= 2 0,0%, 
s=-@ 
2k 
‘ ow ‘ k-—s 
and lim ——- =vare 2 C,(—1)* lim 
kw (2k s=—@ f k>o (2k 
k k 
@ 
=vare & C,—1)* 
s=-@ 
= vare G(—1) 
= vare[l1—a,+a,—...+(—1)"a,,]-*. (5) 
2. This result was applied to Kendall’s (1946) experimental series: 
Series 1 Series 2 ~ Series 3 Series 4 
Vo 2535-11 3414-42 3900-94 2001-20 
Vi 604-10 523-87 1472-92 3377-47 
V; 278-79 239-46 523-63 3958-74 
Veo 102-14 133-23 101-51 3415-88 
Limit (equation (5)) 130-01 112-56 104-62 3335-33 























The agreement seems fair if we note that our result is for an infinite series. It is interesting to see that 
for series 1 the expected value of V2, is 137-66. 

It is also of some interest to note that for a ‘wandering’ series, when the a’s, except «,, are zero and 
a, = —1, the limit (5) is infinite; but that for a, < —1 the limit is finite. In the latter case, however, the 
series is not stationary and the result fails to hold. 


3. The generalization to autoregressive schemes with moving average errors such as that below, 
Wg + Oy Vyig toe + Om Lpim = + Py t--- + Babin (6) 
is straightforward. The formula (4) still holds, 
G,(z) = G(z) G(z), 
1+f,z+...+ 2,2" 14+8,27+...+8,2-" 
1+ Z+... + O_2™ 1+ a,272+...+0,2-"" 
The same reasoning as above leads to the conclusion that 
lim V, = var e G(—1) 
ko 


at weiie [ ape ee 
1-—a,+a,—...+(—1)"a,, 


but now Gz) = 








(7) 
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Note on the inversion theorem 


By J. GIL-PELAEZ 


If F(x) is a one-dimensional distribution function, its corresponding characteristic function is the 
complex function of the real variable ¢: 


d(t) = i) a ett dF (x). 


The theorem which proves that the distribution is uniquely determined by the characteristic function 
is called the inversion theorem. This was first demonstrated by Lévy, who found the expression 


j— e~ttz 


t) dt. 
aH) 
The constant F(0) is no obstacle to an ascertainment of the distribution, because by subtraction we have 


for every interval (a, b) b 1 [@ ¢-tta_ ei 
f “aF(@) = = i) ae Hinde 


Furthermore, when the distribution belongs to the continuous type the constant is also eliminated by 
differentiation, 





l f- 3] 
F(x) —F(0) = =| 


fe) = F(a) = [" eeepyae. 


This is sufficient in the majority of cases, but when we are interested in the explicit expression of the 
distribution function, the constant F(0) is a substantial inconvenience. I therefore present a derivation 
of the inversion theorem by a new method which has the advantage of evaluating the constant term. 
i yi t ; : 
I shall, in fact, show tha tin’ hy 1 2 tag —1) —e-HeG(t) 
=, 2 27 0 at : 





In order to prove this, let us consider the step-function 


. -1l if y<z 
. 2 (° sint(y—z é ° 
sign (y—a) =={ —tyns) Land © # yes, 
0 1 if y>a, 
foo} 
and observe that sign (y—x)dF(y) = 1—2F(z). 
—-@ 


Now, for any positive numbers ¢,A, we have 


A eitad( — #) — e—ita A fo —it(y—2)__ pit(y—2) 
gen ees gd gn 
Tle ut Tle J—o u 


A gi - © Asi de 
=-={"{ te=) aryat = — [ ar)? | sinty—2) 5 
Seg Fla = 4 cies os. 8 
The inversion of the order of integration has been possible because for e<t<A and —«a<y<o 
|=4-2 
t 





1 
poh 
€ 





When ¢ tends to zero and A tends to infinity, we have 


1 wy = f° dF yim =f” siny—2) ., 
. oy oe y WJ t 





TJo u 


x 
=~ [° sign (y—2) arty) = 2F(2)-1, 
—-@ 
which establishes the theorem. We observe that it is possible to take the limit inside the integral sign 


because [nome 
t 
é€ 


is a continuous function of ¢, A with a bounded modulus. 
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When the characteristic function is of the following kind, 
P(t) = Zp, e*, 

r 


the formula of the inversion theorem gives us the following function: 


1 1 (?% Xp, e%*-)— > p, e-s-2r) 
F(z) = sta] r “sit dt 
it 





a 1(*sint(z—z2z,), 1.1 ’ 
=5+5e2[" ; dt = 5 +52 Pr sign (2 2). 


This is a step-function with a saltus p, at x, We must observe that at the points of discontinuity the 
value of the function is the half-sum of the limits on the right and on the left. This is not the value if the 
distribution function is taken to be continuous on the left, but depends on the Dirichlet integral which 
has been used in the proof. 


I have to thank Mr D. G. Kendall for his helpful suggestions. 


A note on the statistical independence of quadratic forms in the analysis of variance 


By J. A. NELDER, National Vegetable Research Station 


A widely used notation in writing out the algebraic identities in the analysis of variance is that employing 
multiple suffixes and using dots to denote averaging over the suffices concerned. Thus with p x g two-way 
classification (to take a simple example) we have 


2 af = La* + D(a, —2z,)?+ Ule,;—2,)*+ U(ay—zj,,—2,5+2,)*. (1) 
i,j iJ i,j 4,9 iJ 
Let us denote the four linear forms in the z,,, namely, 
iy My — 2, Ly-U, and y—X, —X 4+, 
by Yrss> Yass» Yois ANA Yas 
respectively. Then writing x, y,, Y2, Ys» Yq for the five pg x 1 vectors 
{xs}, {Yres}s {(Yass}» {(Yaesd» {Yeast 
and putting y, = B, x we note that 
(i) B, is symmetrical for all a, 
(ii) B,+B,+B,+B, = I, since 


Uy SXF (xy, —X,)+(Ly—X,) + (Xy—Xj,—L y+,), 
(iii) B, Bg is skew symmetric, a + f, since each cross-product such as 


2 (x 3—2,.) (2-4-2, 5+2,) = 2B, Byx 


vanishes identically in the 2,;. 

Now, although accounts of the analysis of variance note the separate vanishing of each of the cross- 
products in (1) and similar identities, it does not seem to have been poir+:-! out that when the 2;; are 
independent normal variates with zero mean and unit variance, the conditions ,i), (ii) and (iii) above on 
the B matrices are sufficient for the quadratic forms on the right-hand side of (i), i.e. 


(B, 2) Bx = 2'B,B,x =2'Bix (a = 1,2,3,4), 


to be independently distributed as y? and that the number of degrees of freedom associated with the form 
x’ Bix is trace B,. Thus the number of degrees of freedom of ¥ (x,,—2;,—2,,;+2,,)* is given by 
4,3 


7 


1 
trace B, = (1-2-2 +) = pq—p—q+l 
«= Pq q Pa-P- qr is 


a familiar result. 
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Since this approach leads to easily verifiable sufficient conditions for the independence of the quadratic 
forms in the analysis of variance when expressed in the above notation, and to a simple method of 
evaluating the number of degrees of freedom associated with each such form, it may be worth giving 
@ proof. 

The result follows from the following theorem in matrix algebra, contained implicitly in a paper of 
A. T. Craig (1943): 

If 


(a) A,,...,A, are n real symmetric N x N matrices, 

(6) Ay+A,+...44, =], 

(c) A,;A; =0 (¢ = 1,2,....n3 7 = 1,2,...,3 ¢+9), 
then the A; are idempotent and there exists an orthogonal matrix H such that non-zero parts of H’A,;H 
are disjunct diagonal submatrices with unit diagonal elements. (It may ‘be noted that this is Cochran’s 
Theorem (Cochran, 1934) as given in matrix notation by Aitken (1950) except that we have substituted 
condition (c) for the condition that the sum of the ranks of the A, equals N. The two conditions are in fact 
equivalent.) If x represents an N x 1 matrix of independent normal variates with zero mean and unit 
variance, then by standard normal theory the quadratic forms x’A,x (= 2x’A?z, since the A’s are idem- 
potent) are independently distributed as y? with degrees of freedom given by trace H’A,H = trace A,, 
since H is orthogonal. 

The B matrices, however, in identity (1) are not immediately seen to satisfy condition (c) of the theorem. 
In fact, the skew-symmetry condition (iii), in conjunction with conditions (i) and (ii), implies this as will 
now be shown. : 

Pre- and post-multiply by A, the relation 


A, +A,+...44, =I 


and add giving 243 +(A,A,+A,A,) +... +(4y4n+4_A3) = 2433 
from (i) and (iii) A, A,+4,A, =...= 4,4, +4,A4, = 0, 
therefore A? = A, and generally A? = A, (i = 1,2,...,n). 

Now since A,A;+A,A,=0, 
therefore Aj A,+4,A,A,=0, 
and so A}A,= A,A,; = —A,AjA,y 


which is symmetric. Hence A;A, is both symmetric and skew symmetric and so is equal to the zero 
matrix. Thus in conjunction with conditions (i) and (ii), condition (iii) implies condition (c) and the 
result follows. 
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REVIEWS 


Statistics, volume u. By N. L. Jonnson and H. Tertry. xi+318 pp. Cambridge 
University Press. 1950. Price 20s. 


This volume represents the completion of the new text-book designed to cover the statistical requirements 
of Parts I and II of the syllabus of the Institute of Actuaries Examinations. The first part, which was 
recently reviewed in Biometrika, vol. 37, p. 453, dealt with the methods of descriptive statistics, questions 
of statistical inference and the simpler statistical tests. This second volume deals with the more exact 
theory of sampling, statistical tests based thereon, including the general theory of statistical tests, 
analysis of variance, correlation analysis, curve fitting and graduation. 

In preparing this second volume the authors have had to keep in mind the mathematical equipment 
of the students for whom the book has been primarily prepared. As this precludes the use of complex 
variable analysis they have been unable to utilize the various elegant derivations and proofs which 
follow from the use of characteristic functions. Similarly, a significant omission from the book is any 
reference to generating functions and moment-generating functions in particular. Whilst the omission 
of these and other special techniques does not prevent a successful mathematical development, some 
direct reference to them and their application would have added to the presentation. 

The chapter numbers follow ou from the first volume and Chapter 11, which is appropriately headed 
the calculus of distribution functions, is based on random sampling from a normal population and develops 
the theory of the comr-yvner distributions, including x7 t, and F,, ,,. The practical application of these 
distributions in deriving significance levels instead of the approximate methods adopted in Chapter 10 is 
developed in Chapter 12. 

Chapter 13 extends the earlier work on the binomial distribution to the multinomial distribution. 
This leads naturally to the problem of ranked variables and to the distribution of the range and of the 
median. The concept of power is introduced in Chapter 14 with a discussion on statistical tests and the 
meaning of likelihood. 

After a short discussion on the various principal sampling systems the essential features of analysis of 
variance are dealt with in Chapter 15 and the principles of correlation analysis follow in the next chapter. 

The final chapter is devoted to the general principles of curve fitting and graduation, including the 
question of goodness of fit. 

As in the first volume each chapter concludes with a number of examples, many of which cover 
important results which could not be included in the text. They thus vary in difficulty between fairly 
wide limits. Some mathematical points have been relegated to appendices, which also include tables of 
x?,t and F. A very valuable feature is the provision at the end of each chapter of a fairly comprehensive 
bibliography related to the subject-matter of the chapter. This inevitably involves a repetition of titles, 
but will be of considerable value to the student who requires more extended treatment of a particular topic. 

With the completion of this second volume it is now possible to obtain a proper perspective on the 
manner in which this text-book is likely to influence the training of statisticians and of actuaries in 
particular. In general, text-books on statistics fall into three main categories: (i) elementary books which 
treat mainly with descriptive statistics and provide little but general guidance, (ii) handbooks which are 
designed mainly for practical users who may not be concerned with or trained for the formal development 
of the statistical tests and techniques, and (iii) advanced texts requiring considerable mathematical 
ability for their proper understanding. This book clearly aims to provide a course from which the essential 
principles of modern statistical methods can be properly appreciated, so that the student will be in 
a position either to use books of class (ii) with a proper understanding of the fundamentals or alternatively 
to proceed to books of class (iii) without undue difficulty. The book must be said to have achieved this 
aim and the student who has worked through the text and the excellent examples should have a balanced 
outlook on the subject. Statistics is essentially a practical subject, and in the reviewer’s opinion this book 
strikes a happy balance between practice and the underlying theory. 

A few unimportant misprints have been noticed in the text, but some errors have been noted in the 
illustrative examples of which the following are important: 

On p. 45 the mean of Ex. 12-1 and & of Ex. 12-2 are incorrect; these are unfortunate errors, as 
these two examples are used throughout Chapter 12 for illustrating various statistical tests and thus give 
rise to other errors. On p. 215 the values of m, and m, should be — 3-9076 and 136-0218 respectively, with 
consequential minor adjustments to Tables 17-2 and 17-5. On p. 220 in Ex. 17-3 the right-hand side of 
the first equation should be 3253, which renders the solution of the equations in error. 
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Various errors have been found among the exercises at the end of each chapter and the following might 
give rise to some difficulty to a student. 


Question 
Exercises no. 
1l 1 Solution should read T'(/+m)/T(l) T(m). 
11 11 Question should read (w — 1) (w + 2)?. 
12 12 For 1-66 and 2-80 in solution read -83 and 1-40. 
13 8 For a, in solution read 2-829 in place of 1-629. 
15 7 For 1-24 in solution read 1-113. 
16 16 For -3904 in solution read -435. 
16 17 In solution 4 and 264 degrees of freedom should be 7 and 261 respectively, etc. 
17 5 For 8-84 in solution read 26-52. 
17 6 For 78-48 in solution read 235-44. 


However, these and the other blemishes can be corrected in a second edition, which this book should 
justify. R. E. BEARD 


Probability and the Weighing of Evidence. By I. J. Goop. vi+103 pp. Charles 
Griffin and Co. 1950. Price 16s. 


The aim of the author has been ‘to provide a consistent theory of probability that is mathematically 
simple, logically sound and adequate as a basis for scientific instruction, for statistics and for ordinary 
reasoning’. The author maintains that subjective probability judgements must be given a recognized 
place in the fundamentals of probability and builds his theory on this basis. 

At the beginning we have a brief critical review of various theories of probability which have been 
put forward, including the axiomatic approach which is adopted by the majority of workers in the 
probability field. This is followed by a discussion of the choice of axioms suitable to act as the basis of 
a theory, and a mathematical formulation of these axioms, with the attendant theorems. Applications of 
probability theory to the weighing of evidence and to general statistical theory are discussed at some 
length 

The corpus of statistical theory has reached such large dimensions during recent years that it is well 
for statisticians to have reminders, such as this book, of the comparatively insecure foundations on which 
such an imposing theory is built. It is fair to say that statisticians in general will not accept Dr Good’s 
approach to statistical problems, although those who give Bayes’ theorem a prominent part in the theory 
of estimation may be stimulated by his work. F. N. DAVID 
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